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CHAPTER 1 - INTRODUCTION 
Background 

At 11:05 PDT, July 23, 1972, the first Earth Resources Techno- 
logy Satellite (ERTS-1) left the Western Test Range at Vandenburg Air 
Force Base near Lompoc, California atop a two stage Thor-Delta rocket. 
Approximately one hour later ERTS-1 was inserted into its final orbit 
915 km (570 miles) above the earth's surface, travelling in excess of 
26,000 km/hr (16,500 mph). 

The satellite circuits the earth 14 times a day in a near-polar 
orbit, allowing almost complete coverage of the earth's surface once 
every 18 days. ERTS-1 was to provide multispectral scanner (MSS) and 
Return Beam Vidicon (RBV) camera imagery, and collect remote ground 
station data. The numerical information was telemetered to ground sta- 
tions in Fairbanks, Alaska, Goldstone, California, and Greenbelt, Mary- 
land. Later, receiving stations were built in Canada, Brazil, and Italy. 

The satellite "was designed to demonstrate the feasibility of map- 
ping and monitoring earth surface features from space." (Reeves, 1975, 
pg 569). The spacecraff's design life was one year; in that year NASA 
expected the satellite would provide multispectral scanning and RBV 
camera imagery for research and evaluation in a variety of application 
discipl ines. 

ERTS-1 (later called Landsat 1) immediately developed problems with 
the RBV cameras, but the multispectral scanning system remained fully 
operational until the fall of 1977 when MSS band 4 (0. 5-0.6 urn) became 
inoperable. Complete instrument failure occurred January 6, 1978, about 
five and one half years after launch (Rohde, 1978). Since the summer of 
1972, two more earth resources satellites have been placed in orbit. 
Landsat 2 joined Landsat 1 on January 22, 1975, and Landsat 3 was launch- 
ed March 5, 1978. Landsat 2 and 3 currently provide repetitive MSS cov- 
erage of almost every spot on the earth's surface once eve.'y nine days. 
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In It's first year of operation. Landsat 1 provided "complete, 
cloud free coverage of the United States; cloud free coverage of a size- 
able percentage of the remaining land surface, polar, and coastal areas 
of the earth; and repetitive coverage ... which shows significant tem- 
poral changes In the United States and other land areas of the world." 
(Reeves, 1975, pg 570). 

Much research has been conducted In the last six years documenting 
the abilities and limitations of manual Interpretation and computer- 
aided analysis of the Landsat data. NASA has sponsored Investigations 
of potential Imagery uses In the United States and fifty two other 
countries Landsat data has been used to delimit. Inventory, and/or 
detect changes on forests, agricultural lands, rangelands, wetlands, 
surface and near-surface marine environments, urban and rural areas. 
Agricultural applications Include the use of Landsat MSS Imagery to In- 
ventory and predict crop acreages and yields. Extensive soil surveys 
have been compiled In Indiana and Missouri. Rangeland uses include 
Identification and monitoring of areas prone to overgrazing and fire. 
Forestry applications Include Incorporation of Landsat data Into a multi- 
stage sampling system which provided estimates of timber volume, total 
timber resources present, and timber stand conditions on a site In 
northern California. Landsat has helped to define the extent of clear- 
cutting In Oregon, and Canadian foresters have used the Imagery to de- 
lineate burned areas In northern Saskatchewan. Spacecraft data have 
been used by the emerging third world nations to produce extensive In- 
itial Inventories of their undeveloped, unexplored lands. Brazil Is 
taking advantage of Landsat's repetitive coverage to study and control 
the development of the Amazon forests. Many studies have found the date 
useful In delineating flooded areas, locating bodies of water, and map- 
ping snowpack in order to predict runoff. Landsat data have been used 
to locate near-surface groundwater sites by Interpretating 1 Inament pat- 
terns, surface lithology, vegetation, and geomorphic properties of an 
area. 

The Landsat MSS Imagery lends Itself to small scale cartographic 
projects since It provides a synoptic view over a wide area from a near 
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vertical angle and maps may be produced soon after data acquistlon. The 
U.S. Bureau of Census has mapped rural and urban boundaries near densely 
populated areas* and monitors the boundaries for significant changes. 

The scanner's ability to penetrate clear water to depths up to 20 meters 
(65 feet) enabled cartographers to chart shallow underwater features In 
the Caribbean. 

In short, the MSS data hold a wealth of Information If Interpreted 
properly and If the limitations of the data are recognized. Interpret- 
ation may be facilitated by taking advantage o; computer-aided analysis 
techniques (CAAT). 

Computer-Aided Analysis Techniques 

Most classification procedures demand that training statistics be 
developed. LARS paesonnel have found that a procedure combining clus- 
tering (unsupervised) and supervised techniques Is the most effective 
method of formulating the training statistics. This approach, called 
the Multicluster Blocks (McB) has been shown to be more accurate and 
cost effective than the commonly utilized supervised method of trolning 
(Fleming and Hoffer, 1977). The McB approach formulates training stat- 
istics by clustering relatively small, heterogeneous areas (approxi- 
mately 1600 to 3600 pixels per block). The spectral classes formed In 
each block are Identified using photointerpretive techniques. The 
statistics are merged to form the final training deck used by the class- 
ifier. Classification performance depends heavily on the analyst's 
ability to correctly Identify the spectral classes and to properly merge 
those classes. Spectral class Identification In turn, depends on the 
photointerpretive capabilities of the analyst or on the availability of 
ground Information (such as type maps) on the training blocks. 

A software systmn (EODLARSYS) was recently developed which has the 
capability of reducing analyst Involvement In the development of train- 
ing statistics. A subset of this system, a collection of processors 
called Procedure 1, more fully automates the classlflcatioii process. In 
Procedure 1, once training points of known Identity are selected and de- 
fined to the computer, various processors are employed to 1. compile the 
training point Information; 2. cluster the scene; 3. label the clusters 
by comparing cluster statistics with the dot Information; 4. classify 
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the scene using the labelled cluster statistics; and 5. display the 
scene and estimate classification performance (see Figure 1.1). 

The Multicluster Blocks approach has been successfully used to 
develop training statistics to accurately classify forested areas 
(Hoffer, 1975; Fleming and Hoffer, 1977). Analysts have demonstrated 
the abilities of the Procedure 1 processors on agricultural areas 
(MacDonald, 1976) but the processors have not been tested In a forest- 
land situation. 

The Interest In Procedure 1 stems from the f^.ct that It can direct- 
ly utilize forest Inventory data that may currently be available on many 
private, state, and national forests. For Instance, Washington and 
Minnesota use a statewide network of field checked points to Inventory 
their forest lands. This readily available gorund Information may be 
located In the Landsat scene. Once this Inventory Information Is Input, 
demands on the analyst are minimal; the classification procedure Is 
fully automated. Multicluster Blocks does not have this capability. 

McB may use the Inventory Information to help Identify the spectral 
classes In each training block, but the procedure remains heavily de- 
pendent on the abilities of the analyst. This dependency Is most acute 
In the statistics merging phase where the analyst actually constructs 
the final training statistics deck. Those who have had to merge stat- 
istics decks know that formation of the final deck may be viewed as an 
art, one which mirrors the analyst's experience and Intuition. Proced- 
ure 1 removes this source of analyst Interaction. 

The overall objective of this study then Is to compare the Multl- 
cluster Blocks approach to Procedure 1. The methods use different clus- 
tering processors to obtain training statistics. These statistics are 
used by th«> Individual classification processors to categorize the area 
of interest into Informational classes. The clustering processors and 
classification processors used In McB and P-1 are compared. Figure 1.2 
depicts the experimental design of the study. 

To summarize, the overall objective of the study Is to compare the 
McB approach for developing training statistics to Procedure I's 
approach. This comparison is made keeping In mind that P-1 has Inherent 
operational advantages which make It attractive In situations where 
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Figure 1.1 Comparison of the Procedure 1 and Multicluster Blocks approaches to 
classification of forest lands. 
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1. The first three P-1 processors are used to develop training stat- 
Isltcs - OOTDATA, ISOCLS, and LABEL. 

2. This figure represents only a partial listing of the results obtain- 
ed In this study. A more thorough explainatlon Is given In the 
Procedures section. 
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forest Inventory data Is available. Within this framework, a number of 
subobjectives may be listed. 

1. A parameter study Is required In order to formulate a para- 
meter set which allows efficent clustering of Landsat MSS 
forestland data. The study Is necessary because little work 
has been done with P-1's clustering processor ISOCLS to dis- 
cern the effects of Its control parameters In a forested 
situation. 

2. The clustering processors, ISOCLS and CLUSTER (used In P-1 
and McB respectively), are compared to determine which works 
most efficently and produces the most accurate set of train- 
ing statistics using Landsat MSS forestland data and forest 
Inventory data. 

3. Two classification processors are compared, one a LARSYS 
routine associated with the McB approach, the other an 
EODLARSYS processor associated with P-1. 
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CHAPTER 2 - LITERATURE REVIEW 
Current Forest Survey Methods 

The necessity for accurate, up-to-date resource Information Is 
highlighted by a number of federal laws passed In the last two decades. 
Natural resource legislation Includes the Multiple Use-Sustained Yield 
Act of 1960, the National Environmental Policy Act (NEPA) of 1970, and 
the Resources Planning Act of 1974 (RPA). 

Of the three, the RPA has the most significant Impact on forestry 
and remote sensing applications. The legislation specifies that the 
Forest Service assess all forest and rangeland resources In the U.S. 
and develop long range resource plans based on these assessments. As- 
sessments arc to be made every 10 years beginning In 1979, and the re- 
source plans drawn from the assessment would be responsive to changes 
anticipated In the coming decade (Glascock, 1976). 

The RPA Is a landmark piece of legislation, and demands the most 
Intensive Information of our natural resources. Essentially, the law 
demands a full national forest Inventory once every 10 years. The In- 
ventory process may be made more timely and accurate by using Landsat 
data to supply some of the necessary Information In a cost effective 
manner (Nichols, 1974; Heller, 1976). 

Current forest Inventory systems generally use ground Information 
(permanent plots or sample plots) alone or In conjunction with aerial 
photography to evaluate their resources. Aerial photos are commonly 
used to stratify a forest. These strata are then sampled on the ground 
to obtain more detailed Information (Husch, Miller, and Beers, 1972). 

Such sampling designs (one and two stage) are In use throughout 
the country. The State of Maine used a combination of techniques. In- 
cluding remeasured plots and an Independent two stage (ground and air- 
photo information) sample to reinventory their lands in 1970 (Ferguson 
and Kingsley, 1972). Colorado's first statewide Inventory used a two 
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stage design (Miller and Choate* 1964) as do the National Forests 1o> 
cated In Colorado (Born, 1977; USFS* 1978; Matteson, 1978). Two stage 
designs have been used In Minnesota (USFS, 1974) and In Missouri 
(Spencer and Essex, 1976). 

Recently, Landsat data have been Incorporated Into the forestland 
sampling techniques, thus forming a three or multistage design. The 
Landsat data may be used to Initially stratify the area of Interest. 
Airphoto and ground samples may then be allocated within the strata of 
Interest In order to adequately characterize the forestlands. 

Landsat data was used to stratify areas In the western part of 
Washington Into broad density and cover type classes (Harding and Scott, 
1978). A multistage technique was also used on the Plumas National For- 
est (Nichols, 1974). Both studies concluded that the use of Landsat 
data saved considerable money that otherwise would've been spent on 
ground and/or airphoto sampling. Nichols maintained that accuracy was 
not sacrificed by Including the satellite platform data. 

The RPA specifies that the U.S. Forest Service must reinventory Its 
lands once every ten years. Landsat data, current forest survey Infor- 
mation, and machine processing may help achieve this goal. Analysis 
methods available to the Landsat data user are briefly reviewed In the 
following section. 

Analysis of Landsat Imagery 

The numerical data telemetered to earth stations by Landsat may be 
used for both qualitative and quantitative analyses. At one extreme the 
reflectance values may be assembled to produce a reconstructed Image of 
the scene viewed by the satellite. This visual model may then be photo- 
interpreted. At the other extreme Is a hypothetical system which Inputs 
the Landsat data, formats It, analyzes It, and outputs desired land use 
statistics, all without the analyst ever seeing the scene Image. Be- 
tween these extremes lay many documented techniques which facilitate ex- 
traction of information from the data. 

Visual Interpretation of Landsat Data 

Landsat data may be used to form a photographic Image of the scene 
called a color composite (bands 4, 5, and 7), or black and white Images 
can be constructed using Information from a single band (usually band 


5). These Images are used 

1. to give the analyst a better feeling for the area with which 
he is working; 

2. to discern cloud cover conditions; 

3. to broadly stratify a scene into smaller areas of interest. 

Howard (1976) and Jaakola (1976) used color composites and photo- 

interpretation techniques to separate forested and nonforested areas. 

Both divided their forestlands into broad subclasses. Jaakola concluded 

"This kind of satellite image analysis* as supported by a 
fairly small amount of surface observations* gives reliable 
acreage estimates which can be utilized directly in planning 
the uses of the resources* or* indirectly* in stratifying 
the area for more detailed forest inventories." 

Heller (1976) studied the different formats available and found 
that forest managers of large ownerships (greater than 4*000 hectares) 
could benefit by using Landsat enlargements for planning. He found 
1:250*000 color composite Imagery most useful. Comparing machine pro- 
cessing and photointerpretation p>-ocedures* Heller et al. (1975) stated: 

"Classification can be done most effectively by computer* 
but photointerpretation produces equally accurate results. 

Choice would depend on availability of trained people and 
equipment." 

Visual interpretation of the Landsat imagery gives the user a 
broad overview of the study area, and may be used for type mapping and 
planning. However* machine processing of the Landsat data has a major 
advantage in that it is a repeatable sequence. Classifications are 
consistent since they're based on mathematical precepts, though the 
method used to develop the training statistics can significantly in- 
fluence the quality of the classification. Photointerpretation is sub- 
jective; given the same interpreter and the same study area on two 
different days, the type map will be drawn differently. Hence photo- 
interpretation of the Landsat data disregards the data's most important 
attribute, its numerical characteristics. 

Machine Processing - An Overview 

Machine processing of Landsat data demands that the computer be 
'taught' to recognize Informationally important spectral classes. The 
computer might be programmed to define spectral classes using ground 
data selected on the basis of informational importance (supervised 
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technique) or a scene could be clustered Into a given nuiiiU'r of s|hh- 
tral classes (cluster analysis or unsupervised technique). Those 
classes are then Identified using ancillary Information. Each method 
Is overviewed below so that the reader might become more acquainted 
with the differences between the two. Studies using each of the tech- 
niques are reviewed In subsequent sections. 

Supervised Development of Training Statistics - An Overview 

Much of the multi spectral classification work done today utilizes 
the supervised method to develop the training statistics for the class- 
ifier. A supervised approach to developing training statistics Is the 
more understandable, straightforward method of the two (supervised and 
unsupervised) considered. Essentially the analyst selects areas of In- 
terest, tells the computer of their location and Identity, then com- 
mands that the computer group all resolution elements In a scene Into 
one of the appropriate classes specified by the training areas. Though 
perhaps more logical, the supervised approach Is dependent upon the 
ability of the analyst to define spectrally separable Informational 
classes. If the classes defined are not spectrally separable, cross- 
classification will result and accuracies will suffer. 

A supervised approach to developing training statistics is a log- 
ical sequence of events, given below: 

1. Specify the informational classes of interest. 

2. Locate homogeneous training fields in each of the infor- 
mational classes using ancillary data. 

3. Formulate the statistics for each informational class. 

4. Evaluate the statistics and subdivide those spectral classes 
which are multimodal. 

5. Use these statistics to classify the area of interest. 

6. Evaluate the classification. 

Supervised classification techniques Involve the use of training 
points or fields (ground information) which Ideally Include all pos- 
sible spectral variations for a particular cover type (in a given 
scene). In order to run an accurate classification using a supervised 
approach, the following guidelines have been defined (Ellis, 1978): 

1. Classes (informational) must be as spectrally distinct 
from each other as possible. 
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2. Training fields should be as homogeneous as possible. 

3. Bimodal (mixed) spectral classes should be avoided. 

4. Training fields should be representative over the entire 
area classified. 

5. Training fields should be 40 acres In size or greater (ap- 
proximately 36 pixels). 

It should be noted that the training field size criterion was suggested 
In relation to a shrublands classification study undertaken In the cen- 
tral Rocky Mountains and on the Colorado Plateau. Training field size 
may vary according to the complexity of the study area; the more com- 
plex the area, the smaller the size of the training fields. 

Ellis found two common problems Inherent In the supervised class- 
ification procedure: 

1. The analyst does not Identify and define Important spectral 
classes, thereby confusing and decreasing the effectiveness 
of the classifier, or he Identifies an Informational class 
that Is not spectrally differentiable. 

2. Within an Informational class, training sites selected do 
not adequately characterize that class. 

Despite problems and limitations, the supervised classification 
technique has been used successfully by Individuals classifying for- 
ested and nonforested areas. 

Unsupervised Development of Training Statistics - An Overview 

An alternate approach to developing training statistics reduces 
analyst Involvement in the definition of the spectral classes. The 
unsupervised approach to developing training statistics Involves a clus- 
tering processor which groups the data Into spectrally homogeneous 
classes. These spectral classes are then Identified using any pertin- 
ent Information available to the analyst, such as airphotos of the 
clustered area, vegetation maps, topographic maps, or ground checked 
points or fields. The analyst must specify the number of spectral 
classes to be output by the cluster processor. If one specifies too 
few clusters, the spectral classes will be multimodal, have large var- 
iances, and may not satisfy the maximum-likelihood classification pro- 
cessor assumptions. If too many classes are specified, one Introduces 
noise Into the system and the analyst may have problems Identifying 
the output clusters. 
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A sei^uence of steps may be outlined for an unsupervised development 
of training statistics: 

1. Cluster the study area. 

2 . Identify each spectral class formed using am i nary data. 

3. Use the labelled cluster statistics to classify the area ol 
Interest. 

4. Evaluate the classification. 

If the 'proper* number of spectral classes Is output, the clusters 
formed should be unimodal. If multimodal clusters appear, the area 
should be reclustered to produce more spectral classes. On the other 
hand. If too many spectral classes have been formed: 

1. Identification of the spectral classes may be difficult; 

2. needless computer time would've been spent clustering; 

3. needless computer time would be spent classifying the 
area; and 

4. those spectral classes that are not spectrally separable 
should be merged. 

The supervised method demands that the ancillary Information be 
used jflrst to define areas of Interest. The analyst anticipates that 
the classes of Interest are spectrally separable. The unsupervised 
approach clusters the data points Into spectrally separable groups, then 
uses the ancillary Information to Identify them. These methods may be 
combined and altered to produce different methods of developing training 
statistics. One of these, the Multicluster Block method, Is discussed 
In future sections. 

Results of Studies Using the Supervised and 
Unsupervised Techniques 

Studies Using the Supervised Method of Developing Training Statistics 

Investigators have used Landsat (4 channel), Skylab (13 channel), 
and aircraft (variable number of channels) data to classify forestlands. 
In addition, multitemporal overlays have been used in some of the stud- 
ies. These data sets incorporate multidate acquisitions over thf? saiiu! 
study area. The acquisitions are precision registered and analyzed, 
the hope being that the additional information will increase the classi- 
fication accuracy. 


Initially, machine processing of multlspectral Information made 
use of aircraft data, since satellite platform Imagery was not avail- 
able to the public prior to 1972. Smedes et al. (1969), Rohde and 
Olsen (1972), and Coggeshall and Hoffer (1973) successfully classified 
forested arear using aircraft data, Smedes used the best fou»* channels 
of 17 channel MSS data flown over Yellowstone National Park. He devel- 
oped training statistics for eight land-cover types and Judged the 
classification accuracy to be SS/K, Three years later, using 6 bands of 
low altitude aircraft data (0.4 to 1,0 um wavelength region), Rohde and 
Olsen analyzed pine, spruce, red oak, white oak, black walnut, black 
locust, and sugar maple plantations. Accuracy of identification was 
estimated to be 85%. Coggeshall and Hoffer experimented with different 
combinations of 12 channel multlspectral scanner data and concluded that 
accurate classification of deciduous and coniferous forests (overall 
accuracy, 80.5%) could be obtained using visible wavelength bands and 
either near or middle infrared bands. 

Investigations using satellite platform Imagery began soon after 
the launch of ERTS-1 (Landsat 1) In July, 1972. A number of studies 
have been completed outlining the capabilities of single acquisition 
Landsat data and the supervised technique for developing training sta- 
tistics. Nichols (1974) used ERTS-1 data as the first stage of a multi- 
stage sampling scheme on the Plumas National Forest In California. The 
predominantly old growth forest was stratified Into subclasses related 
to timber volume using photointerpretation and computer processing of 
the ERTS data. The multistage approach resulted In equal precision of 
estimation of timber volume at a 44% cost savings when compared to con- 
ventional methods. The timber Inventory Incorporating the Landsat data 
permitted estimation of the following parameters; 

1. number of trees per acre; 

2. square foot basal area; 

3. basal area growth over a five year period; 

4. cubic foot volume; 

5. Scribner board foot volume; 

at a cost of $0. 072/hectare ($0. 029/acre) (Glaldinl, 1975), 
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Other studies Incorporating Landsat data and a supervised approach 
Include Kan and Dlllman (1975) who reported 70-80 percent accuracies In 
separating hardwoods, softwoods, and regeneration; and Bryant, Dodge, 
and Warren (1977) who concluded that their supervised classification of 
hardwoods, mixed woods, conifers, and boglands showed large discrep- 
ancies when compared to a Seven Islands Company Inventory. Another nega- 
tive note concerning the usefulness of Landsat maps generated using 
supervised techniques was sounded by Mead and Meyer (1977). They ran a 
number of classifications using supervised training areas In Itasca 
County, Minnesota. They attempted to break out 11 land use categories - 
water, lowland conifer, upland conller, mixed forest, bursh and shrub, 
grassland and open, agriculture, mixed lands, sedge meadows, urban, and 
sphagnum/1 eatherleaf. Their results are best summarized In the final 
paragraph of their report: 

"Evaluation of the various map solutions by experienced 
field resource management cooperators resulted In the judge- 
ment that classification accuracies were so low as to preclude 
practical use for their purpose at this time." 

Classification results may be Improved by Increasing the amount of 
Information available to the classifier. This additional Information Is 
digitized (If necessary) and precision registered with the existing 
Landsat data base. The additional Information Is added as new channels 
of data which may contain: 

1. soils or parent material data, 

2. topographic data, 

3. multlspectral scanner data, 

4. planimetric data (land use or ownership Information). 

Several studies have utilized temporal (multidate) overlays and 

have found that the supplemental Information Increases classification 
accuracies. Williams (1976) ran a supervised classification on 24,300 
hectares of commercial forestlands In North Carolina's Southern Pine 
Region (see also Williams and Haver, 1976). The area runs the gamut 
of forestland cc.idltlons, containing clearcuts, various regrowth stages, 
and artificial and natural pine regeneration. Using Penn State's 
ORSER system (Office for Remote Sensing of Earth Resources), principal 
components analysis, and winter-summer data temporally overlaid, he 
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attempted to stratify the southern pine forest by crown closure. Re- 
sults showed a 94X agreement between computer and photointerpretation 
In breaking out hardwood stands, 96% agreement In pine stands, 54% 
agreement on clearcuts (major confusion class, pine regeneration), 
overall 90%. 

In northern Italy, Lapletra and Negler (1976) used Landsat NSS 
data to estimate acreages of poplar plantations. Using four channels 
(single acquistlon), the Investigators achieved acreage estimation ac- 
curacies of 80%. Using three acquisitions (12 bands) and principal 
components analysis to reduce 12 bands to 6, accuracies jumped to 95% 

In the best case. 

Kalensky and Wightman (1978) overlaid four Landsat acquisitions; 
the results of their Investigation and a number of earlier studies^ 
are shown In Table 2.1. The classifications were done using the super- 
vised approach on the MICA system (Modular Interactive Classification 
Analyzer - developed at the Canada Centre for Remote Sensing, Ottawa, 
Ontario). Note that the average map accuracy (which takes Into account 
the spatial displacement of a spectral class) was 13.6% lower than the 
average classification accuracy. The significant decrease In map ac- 
curacy may be due to 

1. the effects of a 1.1 acre resolution element upon map ac- 
curacy; 

2. possible data registration Inaccuracies; 

3. Inaccuracies In the base map used as 'ground truth'. 

Kalensky and Nightman (1978) found the most accurate supervised class- 
ification was obtained using multidate Imagery, bands 5 and 7. The 
second most accurate classification used three acquisitions - 12 chan- 
nels - and the third most accurate classification used four acquisitions 
- 16 channels. 

To summarize, the supervised technique has the ability to produce 
fairly accurate classifications (80-95%) In forested areas. Accuracies 
Improve If multidate overlays are used. Unfortunately, precision 

1. The studies were done at the Forest Management Institute of the 
Canadian Forestry Service between 1974 and 1978. 
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Table 2.1 Classification and napping 

cessing system (from kalensky and U1ghtman« 


accuracies of a digital Image pro' 


CLASSIFIER 

INPUT IMAGERY 
LANDSAT-l/MSS 

1 CLASS 1 

Agricultural 

land 

Coniferous 

forest 

Deciduous 

forest 

Classification Accuracy/Mapping Accuracy 
(%) 

Max. -likelihood 
single-date 
B4. BS, B6, B7 

Sept. OS, 1972 

87/73 

87/76 

78/6S 

June 03, 1973 

64/48 

7S/64 

7S/S2 

Oct. 06, 1973 

70/S3 

70/64 

69/40 

March 18, 1973 

94/79 

7S/66 

70/Sl 

Max. -likelihood 
nultidate 
B4, BS. B6, B7 

Sept. OS, 1972 
March 18, 1974 

68/76 

88/74 

74/63 

Sept. OS, 1972 
June 03, 1973 
Oct. 0<», 1973 

96/77 

83/77 

84/6S 

Sept. OS. 1972 
June 03, 1973 
March 18, 1974 

91/78 

87/74 

76/63 

Sept. OS, 1972 
June 03, 1973 
Oct. 06, 1973 
March 18, 1974 

92/79 

8S/76 

84/67 

li-L single-date 
BS. B7 

Sept. OS, 1972 

8S/74 

87/7S 

78/6S 

Max. -likelihood 
multidate 
BS. B7 

Sept. OS. 1972 
June 03. 1973 
Oct. 06, 1973 

97/78 

8S/78 

8S/6S 

Image 100 
rectangular 
parallelpiped 
B4, BS, B6, B7 

Sept. OS, 1972 

61/38 

82/78 

80/64 



overlays are expensive.^ The supervised technique has tMO major draw- 
backs. First, the Informational classes designated by the analyst may 
not be spectrally separable, leadfig to classifier confusion. Second, 
the ii'ront end of the process Is analyst Intensive. The unsupervised 
technique minimizes these drawbacks. The results of studies using this 
technique follow. 

Studies Using the Unsupervised Hethod of Developing Training Statlsitics 
Studies using an unsupervised approach are not widespread, and most 
of the research has been conducted at the Laboratory for Applications of 
Remote Sensing (LARS), Purdue University, In Indiana. 

1975 saw the culmination of a two year study of ERTS-1 data cap- 
abilities In mountainous terrain In Colorado. Both supervised and un- 
supervised approaches v«;re tested to discern which method was best for 
producing training statistics. The study found that the modified clus- 
tering technique (later called the Nulticluster Blocks technique) yield- 
ed consistently higher classification accuracies. By using one tenth of 
one percent of the total data set for training, a 94% correct Level II 

9 

classification performance was obtained (Hoffer et a1., 1975). 

Fleming, Berkeblle, and Hoffer (1975) studied four different un- 
supervised analysis approaches on the Ludwig Mountain quadrangle In 
southwestern Colorado. Of the four methods studied on the 15,140 hect- 
are study site 

1. unsupervised cluster, 10 spectral classes; 

2. un:;'iperv1sed cluster, 20 spectral classes; 

3. modified supervised; and 

4. modified cluster, now called Multicluster Blocks; 

the modified cluster analysis yielded the highest accuracy (76.6, 78.5, 
70.0, and 84.6 percent, respecively). 

Intensive Study Sites (ISA) In the western sections of the state 

1. Precision registration may become economically more attractive once 
NASA begins geometrically correcting the Landsat data prior to sale. 

2. Levels of classification detail used In this study (more complete de- 
finitions available In Anderson et a1., 1976): 

Level I: forest, grassland, barren, water. 

Level II: conifer, deciduous, grassland, barren, and water. 
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of Washington were selected In order to Inventory Washington's public 
lands (Edwards, 1977; Harding and Scott, 1978). Each ISA was approxi- 
mately 16 by 20 miles, 512 by 512 pixels. They first tried an unsup- 
ervised classification on a few large blocks within each ISA, but due to 
the spectral complexity of the blocks they were unable to accurately 
Identify the 55 to 65 spectral classes output. Theli alternate analy- 
sis procedure Involved using training blocks to develop the training 
statistics for pure stands of hardwoods and softtmods (supervised). 

Nixed stand training statistics were developed using an unsupervised 
approach since the analysts found It Impossible to select 'typical* 
stands necessary to characterize mixed stands. The combined training 
statistics yielded 279 separate spectral classes(8 Landsat scenes used) 
which were ultimately grouped Into six broad cover types. No accuracy 
statmnent was available for the classification, though blocks of cla s- 
Iflcatlon results were scrutinized by a Resource Analyst and accepted 
If, In his estimation, they were adequate. 

Fleming and Hoffer (1977) compared six different methods of devel- 
oping training statistics, ranging from a purely supervised technique to 
a purely unsupervised technique and found that a Nulticluster Blocks 
approach ^(A) reduced the CPU time, (B) required relatively few man 
hours of time, (C) utilized the lowest amount of supprot data, and (D) 
yielded the highest overall classification accuracy." They also con- 
cluded that the "supervised approach was the most Ineffective method of 
developing training statistics ...”. 

Conflicting results were obtained by Schubert (1978). He studied 
three methods of classifying tracts In central Alberta. Unsupervised ac- 
curacy of classification was 77.3% for all classes (10 land use classes), 
supervised - 78.3%. ^t cost three times as much to develop the unsup- 
ervised statistics, d:>e In part to the novel method used to generate 
them. Additional classifications were run 40 miles north of the orig- 
inal study area. In a different soil-climatic zone. Here the supervised 
classification accuracies fell below the unsupervised. It Is Interest- 
ing to note that the most accurate method (83%) was an automatic classi- 
fication system employing ratios and previously established ratio sig- 
natures. 



20 


This review of the literature did not provide a clear answer to 
the question of which technique (supervised or unsupervlsed) Is better. 
The many studies come up with different conclusions. A supervised ap- 
proach might be considered appropriate for simplistic situations such 
as agricultural settings since spectral variability within an Informa- 
tional class Is limited by the homogeneous nature of the crops. An un- 
supervlsed approach lends Itself to continuously changing, complex situ- 
ations such as mixed forestlands where formation of the spectral class- 
es might best be left to the computer. 

A number of unsupervlsed methods of developing training statistics 
exist. Work by Koffer et al. (i975), arJ Flmlng and Hoffer (1977) have 
shown the Nultl cluster Blocks approach to be most efficent. However 
this method was not developed to optimize the use of sampled ground In- 
formation available on federal, state, and private (commercial) forests. 
The EOOLARSYS software system, specifically those processors used In 
Procedure 1, was designed to accomodate such 9 ' Id data. The five pro- 
cessors of Interest (OOTOATA, ISOCLS, LABEL, CLASSIFY, and DISPLAY) 
have not been tested on forest cover types. The five are evaluated In 
this study, and the three responsible for developing training statis- 
tics (OOTOATA, ISOCLS, and LABEL) are compared to the Hulticluster 
Blocks approach. 

Two Unsupervlsed Approaches - Results 

The purpose of any classification scheme Is to output a product 
of some use to tr.e analyst or his clients. Products may vary from 
tables of acreage statistics to vegetation or land use maps at selected 
scales. In order to output these products, a classifier must be stat- 
istically trained to recognize Infonntlonally Important spectral class- 
es. Two methods of developing these training statistics are evaluated 
In this study. The actual processors Involved In the study are reviewed 
In the Materials section; studies Involving the two approaches are re- 
viewed below. 

Multicluster Blocks 

The methods Involved In developing training statistics using the 
McB approach are described In the Procedures section. Following are 
some of the results obtained using this approach. 
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The Multicluster Blocks procedure has been developed and tested by 
Mike Fleming In a number of studies located In southwestern and north 
central Colorado (Hoffer et a1.» 1975; Fleming and Hoffer, 1977; Hoffer 
and Fleming, 1978; Krebs and staff, 1976). All of these studies utiliz- 
ed ERTS-1 data acquired In 1973, and many of the studies have been un- 
dertaken In cooperation with INSTAAR (Institute of Artie and Alpine Re- 
search, University of Colorado). 

Hoffer and staff (1975) reported an average level II classification 
accuracy of 94.5 percent on the Vallecito Intensive study area. Class- 
ification (test field) accuracies ranged from 85.4 percent (deciduous) 
to 100 percent (water). Similar results were obtained on other Inten- 
sive study areas throughout Colorado. 

Another study (Krebs and staff, 1976) attempted to classify the 
Platoro Quadrangle (southwestern Colorado) Into community types, spe- 
cifically 


1. 

dense conifer. 

5. 

coniferous-deciduous mixes 

2. 

sparse conifer. 

6. 

various grassland types. 

3. 

aspen. 

7. 

bare rock. 

4. 

oak. 

8. 

water. 


Using the 'narrow* definition of the cover types, ^ overall classifica- 
tion accuracy was 69%, with bare rock classified most inaccurately, 43.4 
percent. A 'moderate* definition of community types yielded an 82.7 per- 
cent overall accuracy, with accuracies ranging from 61.2 percent (sparse 
conifer) to 95.8 percent (coniferous/deciduous mixture). Using the 
classification products output (narrow definition) the U.S. Forest Ser- 
vice ran their own evaluation. From Kreb's report (pg 74) comes the 
f ol 1 owi ng : 

"The gut level feelings of those working with the classifi- 
cation were that the data was good, but accuracy figures of 
40 to 60 percent doomed any attempts for the U.S. Forest 

1. Spectral classes obtained during clustering were assigned to a com- 
munity type based on the type's definition. Use of the narrow defi- 
nition indicates that, for instance, those spectral classes found 
only in a coniferous/deciduous mix would be called a coniferous/de- 
ciduous spectral class. A moderate definition of the c/d mix (a 
predominately coniferous mixture) would allow both dense conifer 
and c/d spectral classes to be assigned to it. 
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Service to actually use the data In planning efforts." 

The study found Landsat data useful In classifying an area to Level II, 

but discovered community levels could not be delineated adequately. 

The Multicluster Blocks technique, at first called the Modified 

Cluster Technique, has also been used with Skylab data. 

"Classification of Skylab MSS data resulted In a classifi- 
cation performance of 85.0% for the major cover types, which 
was somewhat less accurate than the Landsat classification, 
probably due to the poor quality of the Skylab data." 

(Hoffer and Fleming, 1978) 

This report goes on to state that attempts to map forest cover types 

(Level III) using Landsat MSS data Indicate 

"... that results at this level of detail would probably 
not be accurate enough to provide useful Information for 
most users.". 

The computer work In the four studies mentioned above were done by 
the same analyst, Mike Fleming. The current study serves not only as a 
comparison between the Procedure 1 processors and the Nulticluster 
Blocks technique, but it will also test the reproducibility of the McB 
technique by a different analyst using data from the same general area. 

Procedure 1 

The EODLARSYS software package has 13 different processors avail- 
able to the user. Only five of the processors concern this study, and 
only three of the five are Involved In developing training statistics. 

In reviewing the literature, the term Procedure 1 Is often encountered, 
and Its definition varies. In Its narrowest context. Procedure 1 refers 
specifically to methods of operation utilized In LACIE (Large Area Crop 
Inventory Experiment) where 

1. random points are located In the Landsat scene and Identified 
using ancillary Information; 

2. a portion of these dots are used to seed the clustering 
processor ISOCLS; 

3. all processors - DOTDATA, ISOCLS, LABEL, CLASSIFY, and 
DISPLAY have fixed parameter levels. 

For the purposes of this report, the following definitions suffice: 
Procedure 1: Points 1 and 2 above are valid, howefer processor para- 
meters are not strictly defined. 

Self-Start Procedure 1: Same as the definition above, with the 
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exception that the clustering processor ISOCLS Is not seeded with dots. 
It self-starts. 

Most of the work using the P-1 processors has been done In agri- 
cultural areas under the auspices of LACIE. No work has been done In a 
forested area, though Reeves (1978) successfully differentiated range- 
land from nonrangeland with probability of correct classification rang- 
ing from 93 to 100 percent. LACIE used the P-1 software system to mon- 
itor worldwide wheat production. The estimates were produced using re- 
peatable procedures and were not revised using other Information 
sources. The users established a 90/90 percent accuracy criterion; 

I.e., their regional or countrywide wheat harvest estimates should be 
within 10% of the actual harvest taken for that region 90% of the time 
(9 times out of 10 years). The LACIE studies showed that the analysis 
procedures are capable of meeting the performance criteria (MacDonald, 
1976). 

Procedure 1 software was developed for LACIE. The system Is in- 
tended to monitor large agricultural areas where monotypic, regularly 
shaped crops predominate. The developers (Lockheed, under NASA con- 
tract) have made no claims about P-1's ability to accurately classify 
heterogeneous lands. Extension of P-1 Into forested areas may not 
yield results as accurate as those specified for LACIE. 
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CHAPTER 3 - MATERIALS 
Study Area 

The study areas are located In the eastern half of the 844,000 
hectare (2.086,484 acre) San Juan National Forest In southwestern 
Colorado (see Figure 3.1). The two study areas were chosen on the 
basis of 

1. heterogeneity of cover types and topographic variation, 

2. presence of locatable features, and 

3. availability of reference data. 

The San Juan Mountains are a rugged mixture of sedimentary rocks 
and tertiary volcanics which supported many silver mines near the turn 
of the century. Most mines have been abandoned and many mountain sides 
are pockmarked with sterile mine tailings and disintegrating buildings. 
Elevations in the San Juans range from approximately 2100 meters (about 
7,000 feet) to over 4270 meters (14,000 feet) along the Continental 
Divide. Most of the areas below treeline (approximately 3350 meters) are 
heavily forested, with species distribution dependent upon elevation and 
aspect of the locale (see Fleming and Hoffer, 1977). Grasslands are pre- 
valent in valley bottoms, and are often used for pasture. Extensive 
fields of hardy grasses and wildflowers dominate much of the tundra 
(above treeline). Mans activities manifest themselves in man-made reser- 
viors, clearcuts, ha> farming or pasturelands at the lower elevations, 
and four wheel drive trails crisscrossing forest and tundra. 

The first study site is located 29 km northeast of Durango and en- 
compasses approximately 4678 hectares (11,560 acres), including the 
northern third of the Vallecito Reservior. Elevations on this 100 by 
100 pixel block range from 2332 meters at the reservior to 3293 meters in 
the northwest corner of the study area. Ponderosa pine is found predom- 
inantly on the lower hillsides surrounding the lake. As elevation in- 
creases, ponderosa gives way to the Douglas fir/white fir cover type. 



e 3.1 Location of the two study sites used to evaluate the 
Procedure 1 and Multicluster Blocks approaches. 
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At still higher elevations species composition graduates Into the 
Englomann spruce/subalpine fir cover type at approximately 2740 to 2900 
meters (9000 to 9500 feet), depending on the aspect of the site. Quak- 
ing aspen Is ubiquitous, forming mixed or pure stands wherever cutting 
has occured or canopies have opened. Gambel oak may also be found In 
pure, small, shrubby stands at the lower elevations, generally below 
2600 meters on the south facing slopes (Miller and Choates, 1964). 

The second test site, the Devil Mountain Quadrangle, is approximately 
48 km (30 miles) east of Durango and covers 15,156 hectares (37,450 
acres). The Pledra River runs north-south roughly bisecting the quad- 
rangle. The lowest elevation of the quadrangle lies In this river 
channel at the point where the river leaves the quad - 2012 meters (6600 
feet). The highest elevation Is atop Devil Mountain - 3036 meters (9957 
feet). Like the Vail eel to quadrangle, the area Is extremely heterogen- 
eous, with ponderosa pine flanking the lower elevations of the Pledra 
River valley. The ponderosa pine may be dense enough to generate a 
parkllke grasslands understory, but In less dense stands Is often found 
In conjunction with Gambel oak. As elevation Increases or as aspect 
changes from south to north, the pine 1s found In mixture with Douglas 
fir, white fir concomitant. Aspen is found with increasing frequency 
above 2200 meters, and forms mixtures with the Douglas fir, white fir, 
and ponderosa pine. Englemann spruce and subalpine fir are found at 
the higher elevations in conjunction with Douglas fir, white fir, and 
aspen. Aspen forms large, pure stands between 2750 - 3050 meters (9,000 
to 10,000 feet), perhaps as a result of small lightning fires or har- 
vesting activities. Clearcuts are noticeable, especially In ponderosa 
pine stands near the Pledra River. 

Supporting Information 

Selection of the study sites were based not only on the physical 
attributes of the countryside, but also on the availability of ancillary 
information. The San Juan National Forest and surrounding federal lands 
have been the subject of a number of remote sensing research endeavors 
carried out by LARS and INSTAAR personnel. Hence, much detailed infor- 
mation was available which aided in the analysis procedure. 
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Type maps drawn by the Institute for Artie and Alpine Research, 
University of Colorado provided a baseline classification of the two 
study areas. The species type lines have been drawn atop an acetate 
reproduction of 1:24,000 U.S.G.S. minute quadrangle maps. The type 
lines were the result of photointerpretation efforts at INSTAAR using 
color Infrared photographs. Various areas were field checked to Insure 
accurate species Identification. These type maps were laid atop 
1:24,000 llneprinter output to aid In the Identification of clusters 
or to check classification results. 

Also available on the two study sites were 1:120,000 color Infra- 
red photographs taken by NASA In 1973 In conjunction with the Landsat 
1 study. Cluster classes were Identified using these photos, the type 
maps, a Zoom Transfer Scope, and 1:24,000 scale cluster output. 

The Landsat data used In the analysis was obtained on June 5,1973 
at 9:20 am local time. Though some cloud buildup was noted on the 
CIR photos west of the Vallecito study area and north of Devil Mountain 
Quadrangle, no clouds were present on either study area on the Land- 
sat data. 

The Computing System 

The entire analysis was done on LARS' IBM 370/148 medium speed 
computer operated by the Laboratory for Applications of Remote Sensing, 
Purdue University. The central processing unit has a number of peri- 
pheral devices Including tape and disk drives, tape and disk storage, 
remote site terminal access, llneprinters, cardpunches, card readers, 
and a matrix printer. The computer Is located In the FI exlab II buil- 
ding, Purdue Industrial Research Park, West Lafayette, Indiana. How- 
ever remote terminal sites Include (In addition to LARS terminals In 
FI exlab I) Goddard Space Flight Center In Greenbelt, Maryland; Johnson 
Space Center In Houston, Texas; Indiana State University In Terre Haute, 
Indiana; Wallops Flight Center, Wallops Island, Virginia; St. Regis 
Paper Company, Jacksonville, Florida; and Alabama A and M University, 
Huntsville, Alabama. 

Software - The Programming Aspec t 

Two software systems were used In this study. Both are special 
purpose program sets developed to aid appllcatlons-orlented scientists 
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In the analysis of multlspectral data taken by a wide variety of spec- 
tral sensors. The software packages are examined below. 

LARSYS - Purdue's MSS Data Analysis System 
The Idea behind the LARSYS software system germinated In the mid 
1960's. Since that time hundreds of people have helped to develop and 
document this software package “specifically designed for remote sen- 
sing data analysis" (Spencer and Phillips, 1973). This program set 
developed In conjunction with technical support from IBM and grants from 
NASA Includes 18 fully documented processors. 

Five LARSYS processors were used In this study. The first three, 
CLUSTER, MERGESTATISTICS, and SEPARABILITY, were used to develop train- 
ing statistics using a Multicluster Blocks approach. CLASSIFY and 
PRINTRESULTS were responsible for classifying the area and providing 
accuracy estimates. The processors are described below; for more rigor- 
ous explai nations, see Spencer and Phillips, 1973. 

CLUSTER: After the analyst has selected the training areas, each 
must be clustered. Clustering partitions the data Into smaller, sim- 
pler spectral groups. The purpose of clustering Is to form "groups ... 
of data that have elements similar to one another within the cluster 
and different from the elements of the other clusters." (Kan, 1972). 

The LARSYS CLUSTER processor groups the data vectors by first ar- 
bitrarily selecting Initial cluster means and then assigning each pixel 
to that mean closest to it (minimum euclidean distance criterion). Once 
all pixels are assigned, cluster means are recalculated and the assign- 
ment is repeated. The algorithm stops when a prespecified percentage of 
the pixels (CONVERGENCE) are stabilized (i.e., are reassigned to the 
same cluster In two consecutive iterations). 

The user may adjust five parameters which affect the clustering 
results. Other control options may be used, but they control input and 
output material. The following controls are of interest: 

CONV XX. X Convergence specifies the percentage of pixels that must 
maintain cluster allegence between successive iterations in order for 
processing to cease (default 100.0). 

MAXCLAS X This parameter specifies the number of clusters desired 
(default 5). 
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INTV X The Interval card specifies what portion of the points In a 
cluster are used .to calculate the cluster mean. An Interval of 1 In- 
dicates that all pixels In the scene are assigned to the closest clus- 
ter mean and are then used to compute the cluster mean after each Iter- 
ation. Interval 2 means every other pixel Is used to compute the mean. 
Interval 3, every third pixel Is used. Once the convergence parameter 
Is satisfied, all pixels are assigned to a cluster and the final clus- 
ter means calculated (default 1). 

THRESH XX. X The threshold control card specifies a separability quo- 
tient value below which a suggestion Is made to group the clusters. 

The cluster map formed using these and other parameters Is com- 
pared to photographs and the spectral classes. If acceptable, are Iden- 
tified. 

The McB procedure produces as many statistics decks as there are 
training blocks. The classification processor uses a single statis- 
tics deck to classify an area Into various Informational classes. The 
following processor allows the analyst to formulate the final statistics 
deck from the training block decks containing the Identified spectral 
class statistics. 

MERGESTATISTICS: The MERGE processor combines statistics from all 
training blocks to provide the single deck required by the classifier. 

In addition, similar spectral classes (within one or more blocks) may 
be combined, useless or confusion classes may be deleted. The result- 
ing statistics deck may be used to classify test areas and. If results 
prove unsatisfactory, the original block statistics decks may be re- 
merged differently. 

SEPARABILITY: In order to evaluate the consequences of any mani- 

pulation performed using the MERGE processor, the statistical separa- 
bility of the spectral classes are calculated. The separability of the 
classes Is determined using a distance criterion called the transformed 
divergence. As the transformed divergence values Increase, the proba- 
bility of correct classification Increases, assuming the spectral 
classes are correctly labelled. 

CLASS IFYPOINTS: This LARSYS processor Is a Gaussian maximum-likeli- 
hood (ML) classifier which assigns each pixel to an Informational class 
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defined by the analyst In the training statistics deck. Using a pixel's 
spectral values obtained from the data tape, the processor calculates 
the probability that the data vector belongs to each of the spectral- 
informational classes. CLASSIFYPOINTS assigns the pixel to the class 
with the highest probability. The classifier assumes the spectral 
classes used In the training statistics deck are unimodal and exhibit a 
normal distribution. The classification results, written to disk or 
tape, can be evaluated using PRINTRESULTS. 

PRINTRESULTS: The final step In any classification procedure In- 
volves output of the final classification products and an evaluation of 
the results. If deslrrj, an alphanumeric 1:24,000 scale llneprinter map 
may be output. Of particular value are classification performance 
tables concerning training and/or test fields delineated by the analyst. 
Acreages of each Informational class may be tabulated using another pro- 
cessor (fiLPRINT) If the total acreage of the scene Is known. 

The experimental methods which utilize the programs outlined above 
are explained In the Procedures section. 

EODLARSYS - Johnson Space Center's Software 

In 1970, LARSYS Version 2 was Implemented on a UNIVAC 1108 EXEC 2 
System as a batch machine at NASA/ Johnson Space Flight Center In Houston. 
Since 1970, personnel at the Earth Observations Division have Indepen- 
dently modified the original LARSYS package Into a new, nonetheless 
relate, software system. Similarities are noted between a number of 
processors In LARSYS and EODLARYSS, though some EODLARSYS processors 
have been created to perform new functions. 

The five EODLARSYS processors of Interest to this study were de- 
veloped In an attempt to more fully automate the classification pro- 
cess. The first three, DOTDATA, ISOCLS, and LABEL are responsible for 
developing a statistics file similar to the LARSYS statistics deck. 

The last two, CLASSIFY and DISPLAY, use the training statistics to clas- 
sify the area and produce performance tables and 1:24,000 scale line- 
printer maps. 

These five processors are used In Procedure 1, a method designed 
to classify agricultural areas using Information collected on randomly 
selected pixels (dots) In the scene. Point Information Is currently 



available on some forestlands In the form of Continuous Forest Inven- 
tory (CFI) or sample points or plots which are locatable In the Landsat 
scene. The Procedure 1 processors can make direct use of this Infor- 
mation to develop labelled training statlscics which may be used to 
classify an area. 

The five EODLARSYS processors used In Procedure 1 are described 
below. 

DOTDATA: The DOTDATA processor Is (In this study) the first program 
run. It allows the analyst to Input supervised training points whose 
Identities are known. The computer retrieves the spectral reflectance 
values for each pixel Input to DOTDATA* and compiles a numbered listing 
of the Information. Each data vector (pixel) now has Its own unique 
number which can be used to refer to the pixel In processors following 
It. 

The dots defined to the DOTDATA processor may be divided Into Type 
1 and Type 2 dots. Type 1 dots may be used as cluster center starting 
dots In ISOCLS, and are used by the LABEL processor to Identify the 
clusters formed In ISOCLS. Type 2 dots are used for bias correction, a 
classification accuracy calculation explained In Reeves (1978)* or Wills, 
Gardner* and Aucoln (1977). Only Type 1 dots were used In this study. 

ISOCLS: This versatile clustering function accepts as Initial clus- 
ter centers 

1. dots (pixels) from DOTDATA* 

2. multlspectral starting values Input by the analyst* or 

3. nothing - It may self-start. 

The clustering processor outputs unlabelled (unidentified) spectral 
classes (cluster means and covariances) for the LABEL processor. 

ISOCLS has 9 parameters which affect the number of clusters output: 

NMIN SEP STOMAX 

PMIN ISTOP CLUSTER 

DLMIN PERCENT SEQUEN 

Four of i^hese were Investigated In the parameter study* and these four 

are described below. 

SEQUEN XXX: ISOCLS manipulates groups of pixels by first assigning all 

pixels to particular groups* then splitting or combining those groups 
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to form new groups (or clusters). The analyst controls the order of 
splitting and combining by using the SEQUEN control card (default SC« 
Split, Combine). 

ISTOP X; The maximum number of Iterations performed In the 1n1t1_al 
split sequence (default 10). For example: 

ISTOp'^ 10^ splits and one combine Iterations. 

iItOP*^ 10^^ splits and one combine. 

ISTOP^ Q^^^) would yield 8 splits, a combine, a split, and a combine. 

Whether or not a particular cluster Is split or combined on a S or 
C Iteration depends on the values of the following parameters. 

STONAX X.X: Any cluster with a standard deviation greater than X.X In 
any channel Is split on a split Iteration (default 4.5). 

DLMIN X.X: Any clusters whose means are closer than X.X units are com* 
blned on a combine Iteration (default 3.2). 

Parameter studies by Moritz, Pore, Yao (1978), and Pore, Moritz, 
Register, and Yao (1978) using the ISOCLS processor have found that 
classifications of LACIE segments were more accurate when ISOCLS was 
not allowed to Iterate. Hence, for agricultural land classification, 
the following parameter levels have been siggested: STDMAX 15, ISTOP 1, 
DLMIN 0 CLACIE parameters). The clusters obtained from a three pass 
grouping^ are very much dependent on the Initial cluster centers (Input 
from DOTOATA). In the studies cited above, the Initial centers were 
chosen randomly from a dot file of a LACIE segment. The approach may 
p.'uve disasterous In a forest scene because: 

1. The means and variances of the spectral classes which char- 
acterize the area are dependent on the dots used to seed the 
processor. 

2. The clustering function is not allowed to Iterate. Hence ad- 
justments In the cluster means which would ordinarily reduce 


1. The ISOCLS parameters given above are those used In the LACIE study. 
ISOCLS Is a three pass processor In this Instance, with the second 
and third passes deleting any clusters which are too small (number 
of pixels). A one pass grouping function could be Instituted If 
ISTOP Is set equal to 0. 
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the sum of squares error (SSE) are not made. 

However* the three pass clustering algorithm (LACIE parameters) merits 
Investigation. 

LABEL: Unlabelled cluster statistics from ISOCLS are compared to the 
Type 1 dots compiled In OOTOATA. The analyst may choose one of two 
procedures to Identify the clusters: 


1. K-Nearest Neighbor: This alternative finds the given number 
(K) Of type 1 dots closest to the cluster mean* and labels 
that cluster using the dot Identifications. Thus* If the 
analyst selected 5-Nearest Neighbor procedure* and the major- 
ity of the five closest dots were grass* the cluster would 

be labelled grass. 'Closeness' Is determined by geometric 
(Li and L2) distances In n space* where n equals ^he number, 
of channels used for clustering and dot labelling purposes. 

In the event of a tie, K minus one dots are considered* the 
dot farthest distant fru>'' the cluster mean Is dropped from 
consideration. 

2. A11-of-a-K1nd: Another method used to label clusters In- 
volves checking the Identification of all Type 1 dots within 
the cluster. If alj^ of the Identified pixels In that clus- 
ter belong to the same category* then the cluster Is labelled 
accordingly. If all the Type 1 dots within that cluster do 
not have the same Identification* then the labelling proced- 
ure defaults to K-Nearest Neighbor. 

Pore et a1. (1978) found that results were consistently more ac- 
curate when AII-of-a-KInd (Instead of K-Nearest Neighbor) was used to 
label the agricultural statistics. Using 4* 8* 12* and 16 channels of 
multlspectral Information* accuracy Increases ranged from 1.17 to 9.68 
percent. In the course fo this study* I attempted to use the All-of-a- 
Klnd labelling procedure* but the processor kept defaulting to K-Nearest 
Neighbor since clusters formed In a forested scene commonly hold Type 1 
dots In more than one category. Therefore* the K-Nearest Neighbor 


4 


1 



1. In other words, If using LI distance to determine distance from a 
dot to a cluster mean - 4 channels of data* where: 

Of " multlspectral value of that pixel In channel 1, 1 ■ 1 to 4. 

C4 ■ multlspectral value of the cluster mean In channel 1, 1 > 1 
' to 4. 

LI distance ■ * ®4*^4| 

L2 distance -^/(Of-Cf)^ + (Og-C^)^ ♦ 
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labelling procedure was used throughout the course of this study to In* 
sure labeling consistency. 

CLASSIFY: One of two procedures may be used to assign a pixel to one 
of the group* defined by the training statistics deck. The first pro- 
cedure Involves a standard* maximum-likelihood classifier. The proba- 
bility that a given pixel belongs to a given subclass Is calculated for* 
each subclass. The pixel Is assigned to the subclass which exhibits the 
highest probability. 

The catej^rles classifier (the second of the two procedures) com- 
putes a probability density function for each category defined for the 
statistics deck. A category Is defined as a group of Informationally 
related spectral subclasses. In other words* the Hardwood Informational 
class may be represented by a number of spectral subclasses. That group 
of spectral subclasses Is considered a category. The processor classi- 
fies a pixel Into the category exhibiting the highest probability. The 
pixel Is then classified Into a subclass within the category using maxi- 
mum-likelihood. The categories classifier Is also called the Sum-of- 
Normal -Densities (SoNO) classifier. The SoNO classifier Is Invoked (In 
this study) using the CATEGORY FILE card. This card tells the processor 
that the classnames In the labelled statistics deck are to be used as 
categories. So* If the Hardwood class contains three subclasses (three 
clusters)* the processor sums the density function for the three sub- 
classes to produce the density function for the category Hardwood (see 
Figure 3.2). These category density functions are used to classify a 
given pixel Into one of the categories. Once a pixel Is assigned to a 
category* the probability that It belongs to each one o* the subclasses 
within that category Is computed* and the pixel Is assigned to that sub- 
class with the highest probability. 

DISPLAY: Procedure 1 accuracy statements may be calculated using 
Type 1 dot classification accuracies or using test field accuracies. If 
the former Is chosen* the processor evaluates and adjusts classification 
accuracy using Type 1 and 2 dots. 

Test fields or points may be Input* In which case Type 1 and 2 
dot accuracy calculations are not determined. Throughout the course of 
the classification procedure* the analyst has the ability to outline 
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areas that are of no Interest (Designated Other) or areas that cannot 
be Identified) such as those areas covered by clouds (Designated Uniden- 
tifiable). Test fields may only be used If DO (Designated Other) or 
DU (Designated Unidentifiable) fields are specified. DO and DU 
areas were not used In this study thus allowing use of test fields to 
provide the accuracy considerations. 

Of the 13 processors currently online In the EOOLARSYS software 
system, only the five described above were used In this study. The 
descriptions given for each of these five processors are not complete; 
many options have not been discussed since they were not directly In- 
volved In the study. Stewart and Aucoln (1978) supply the most con- 
cise documentation on these processors as Implemented on the LARS IBM 
370/148. Additional Information may be found In user documentation 
written by Minter, Wills, and Gardner, 1977, and Wills, Gardner, and 
Aucoln, 1977. 


!P 
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CHAPTER 4 - PROCEDURES 


Introduction 

In order to meet the major objective and sub-objectives outlined In 
the Introduction, the study was broken Into two phases. The first phase 
consisted of the ISOCLS (EOOLARSYS clustering function) parameter study, 
since little was known of the effects of the various parameters on the 
statistics output by ISOCLS. The second phase used Information gathered 
In the first phase to establish processor parameters so that a valid com- 
parison of McB and P-1 could be made. These phases are described In 
subsequent sections. 

Procedures of Interest - 

the Multi cl ustir Blocks Approach an? Procedure 1 
Before any experimental procedure Is described, the reader should 
understand the methodologies Involved In the two approaches being com- 
pared. The processors used In each of the approaches (McB and P-1) are 
described In the Materials section. The use of these processors to pro- 
duce a classification Is reviewed below. 

The Multi cluster Blocks Approach 
The Multicluster Blocks procedure Involves selecting relatively 
small, heterogeneous blocks (1600 to 3600 pixels) to train the computer. 
Four primary considerations In selecting the location of the training 
blocks are defined by Fleming and Hoffer, 1977. The blocks should 
have 

1. "a representative sample of each Informational class 

2. "three to five cover types 

3. "a precisely locatable feature and 

4. available photography. 

Each block Is Initially clustered Into approximately 14 to 16 spectral 
classes, though more or less may be desired depending on the spectral 
heterogeneity of the block. A Zoom Transfer Scope and the appropriate 
photography are used to Identify the spectral classes. 
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The spectral classes output from each of the blocks must be pooled 
to combine the training statistics into a single deck, Fleming and 
Hoffer (1977) found that a three Iteration, moderately modified pooling 
procedure yielded the most accurate results.^ The first iteration pools 
those spectral classes with a divergence value of 500 or less. The 
second and third iterations pool classes with divergence values less than 
or equal to 1000 and 1500 respectively. 

The training statistics formed are used to classify a small portion 
of the study area, perhaps one of the training blocks. If the results 
are not satisfactory, and the original clusters are acceptable, then 
only the pooling need be redone, an inexpensive (though analyst inten- 
sive) procedure. Once acceptable training statistics are formulated, 
the entire area of interest is classified. 

The Procedure 1 Approach 

Procedure 1 uses the five processors described in the Materials 
section to classify the area of interest. The analyst has control over 
the processors' parameter levels and must input the locations of pixels 
of known identity. If the pixel identification information is available 
from some existing source, such as forest inventory data, analyst in- 


volvement is markedly reduced. 

The dot information is compiled and may be used to seed the clus- 
tering processor. Regardless of the clustering method, the dots, whose 
identity and spectral reflectance values are known, are used to iden- 
tify the spectral classes output by the clustering processor. The 


1. An analyst may accept the judgement of the SEPARABILITY processor 
and pool those classes suggested by the processor (unsupervised 
pooling). Alternatively, the analyst may use his own discretion 
when pooling, basing his decisions on the identity of the spectral 
classes, the bispectral plot, and the divergence values between 
the classes (modified pooling). 

2. Transformed divergence measures spectral class separability. A 
value of 0 means the two classes are identical, a value of 200 
(the maximum) means the classes are extremely dissimilar, very 
separable. 


ri':s>roducibitjty of the 

Tf*' 


advantage of such a system rests with the fact that the labelled statls 
tics are free from any biases^ except those Inherent In the Initial 
Identification of the dots. The disadvantage stems from the fact that, 
unless the analyst Intervenes, each spectral class produced Is labelled 
and used In the final training statistics. Border spectral classes 
(I.e. hardwood-conifer mixes or grass-hardwood mixes) must be labelled 
either hardwood, conifer, or grass. Hence the labelling process may In 
some Instances promote misclassificatlon. 

The labelled statistics are used to train the classification 
processor. Accuracy of classification may be assessed by comparing 
the classification's Identity of the Input dots to their original 
Identity or by using test fields. 

ISOCLS Parameter Study 

The objective of this portion of the study was to develop a set 
of ISOCLS parameters which could be used to cluster a forested area 
accurately and economically. The 100 x 100 pixel Vallecito Reservoir 
area was chosen for this portion of the Investigation. The criteria 
used to judge cluster performance were 

1. number of clusters produced. 

2. CPU time used, and 

3. classification performance. 

The third criterion unfortunately Is Influenced by how well the LABEL 
processor does Its job. LABEL'S performance In turn is Influenced by 
the ability of the analyst to correctly identify the dots used In the 
dotflle. Finally, classification performance Is Influenced by how well 
test fields were selected and Identified. In spite of the subjectivity 
of this performance criterion, accuracy of classification was used In 
conjunction with the other two criteria. 

Type 1 dots and test fields were delineated on the study site. 

The dots were used to seed the ISOCLS processor and were also used by 
LABEL to Identify the clusters formed In ISOCLS. These dots were 
located using a Systematic sample grid system that Is used by the USFS 
Northcentral Forest Experiment Station to Inventory their forests In 


1. Biases may be Introduced by analyst Identification and manipulation 
of the training statistics. 
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Minnesota. The grid covers a township sized area and Is composed of 
121 dots, arranged In 11 rows of 11 dots. Each row Is canted five 
degrees above the horizontal to avoid having entire rows or columns fall 
on a N-S or E-W road (see Figure 4.1). A 1:24,000 township sized grid 
was drawn to locate the Type 1 dots on a llneprinter cluster map of the 
study area. Sixty dots were located on the Vail eel to study area using 
the grid; these dots simulate the Information which might be obtained 
from Forest Service records. An additional 23 dots were located In 
cover type categories that had not been sufficiently represented by the 
systematic sample. These 83 points were Identified using the Zoom 
Transfer Scope, a 1:24,000 llneprinter map of the Vallecito area, and 
1:120,000 color Infrared photos. Table 4.1 shows the composition of the 
Type 1 dots used. 

Table 4.1 Type 1 dots used in the ISOCLS parameter study (Vallecito 
study area), by class. 

Located 


Classname 

Systematically 

by Cover Type 

Total 

Hardwood 

17 

0 

17 

Conifer 

33 

0 

33 

Grass 

6 

8 

14 

Barren 

0 

9 

9 

Water 

_4 

_6 

10 

Total 

60 

23 

83 


Manually selected test fields were also delineated on the 100 by 
100 pixel area. The accuracy of classification of these test fields was 
one criterion used to judge how well a particular set of parameters 
performed in classifying the area. 

The same dotflle (the same 83 points) and the same test fields 
were used throughout the parameter study. 

The parameter study was broken Into two parts. The first portion 
of the parameter study Investigated the effects of the following ISOCLS 
parameters: PERCENT, STDMAX, DLMIN, ISTOP. Initially each of the para- 
meters was studied by holding all other cluster parameters at their 
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default values and varying the value of the parameter In question. On 
the basis of the responses of the ISOCLS processor to various parameter 
manipulations, parameter combinations were choosen for evaluation. 

The second part of the parameter study Investigated the effects of 
self-starting and seeding ISOCLS with a random sample of Type 1 dots. 

From previous experience. It Is known that: 

1. The LACIE study effectively used a two pass clustering 
algorithm. 

2. With all ISOCLS parameters set to LACIE specifications, the 
number of seed dots Input equals the number of clusters out- 
put + 2 clusters.' 

3. These clusters often retain the same Identity as the starting 
seed when Identified by LABEL. 

Hence the parameter study compared the effects of self-starting, LACIE's 
two pass approach, and a seeded, iterative approach. 

The parameter study makes no pretenses at completeness, rather takes 
the attitude that: 

1. a little knowledge might greatly Improve classification 
accuracies and save computer time. 

2. other parameter studies might build on the Information 
compiled here. 

The results of this work are presented In the Results section of this 
report. 

Classification Study 

Establish Training and Testing Blocks, Fields, and Points 
The overall objective of this part of the study was to compare the 
best methods found to date for classifying forested areas using two 
different training techniques, the Procedure 1 processors (OOTDATA, 
ISOCLS, and LABEL) and the Multicluster Blocks approach. 

Three Items were necessary to proceed with the comparisons: Type 1 
dots, training blocks, and test fields. 

The Type 1 dots were selected using the 121 point township grid laid 
atop an unsupervised classification (16 classes) of the Devil Mountain 
Quad. These dots were Identified using aerial photography and the Zoom 

1. The number of clusters produced will always be less than or equal to 
the number of Type 1 dots Input. Generally as the number of seeds 
rises, variability Increases. 


r 
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Transfer Scope. Additional Type 1 dots were chosen In a cover type only 
If that type was not sufficently represented by points selected using 
the grid. In other words. If each of the five cover types did not con- 
tain at least ten Type 1 dots,^ suppllmentary dots were arbitrarily lo- 
cated In the Inadequately represented cover type(s). Sufficnet dot repre- 
sentation In each cover type Is mandltory due to the nature of Procedure 
1's cluster Identification. If, for Instance, the K-Nearest Neighbor 
technique Is used to label the clusters, with K set equal to 5, and only 

two dots have been Identified as barren In the dotflle, most likely no 

2 

cluster will be labelled barren. Since the barren class would not be 
represented In the labelled training statlsitcs, all barren areas In 
the data would be automatically misclassifled. The decrease In classi- 
fication accuracy would not be a function of ISOCLS or LABEL as much as 
It would be the fault of the method used to select the Type 1 dots. In 
order to overcome this problem, each of the five major cover types had 
at least ten Type 1 dots. 

Three training blocks were established on the Devil Mountain Quad- 
rangle. The blocks, ranging In size from 841 pixels to 1804 pixies, were 
established In heterogeneous areas according to the criteria listed on 
page 37of this report. 

Manually selected test fields were located in homogeneous areas again 
using the same materials that were used to select the training blocks. 

The same test fields were used to evaluate each of the classification 
methods tested (see Table 4.2). The test fields contained no Type 1 
dots and were not located In any of the training blocks. 

The study site contained 33,123 pixels (183 lines x 181 columns). 
ISOCLS. using four channels of data data, can accommodate 91,728 pixels. 
Procedure 1 demands that the entire area be clustered In order to 

1. The designation of 10 Type 1 dots as the allowable minimum number of 
dots Is arbitrary and peculiar to this study area. This analyst 
cited ten as a minimum because no more than that could have been 
reliably located In the barren class. 

2. It Is actually possible for a cluster to be labelled barren under 
these circumstances, but the chances of correctly Identifying the 
barren class are small. 
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develop training statistics. In order to reduce CPU time and remain 
within the dimensional restrictions of CLUSTER, the quadrangle was clus* 
tered on an Interval of two.^ 


Table 4.2 

Test fields used on the Devil Mountain Quadrangle 

Cover 

Test Fields 

Percent 

Test Field Pixels 
Percent 

Type 

Number 

of Total 

Number 

of To'tal 

Conifer 

39 

41.5 

932 

55.3 

Hardwood 

33 

35.1 

457 

27.1 

Grass 

16 

17.0 

252 

14.9 

Barren 

_6 

6.4 

46 

?.7 


94 

100.0 

1687 

100.0 


Classification Procedures 

A classification processor uses training statistics which enable 
the classifier to group pixels Into the Informational classes defined by 
the analyst. In this study, six different runs were made employing 
various processor combinations and different parameter settings. Four 
of the runs used the Procedure 1 approach to developing training statis- 
tics (I.e., form a dotflle, cluster the entire area, use the dots to 
label the clusters); two used the NcB approach (I.e., cluster each 
training block separately. Identify and merge the spectral classes). 
Remember that the P-1 approach does not necessarily mean that only 
EODLARSYS processors were used; clustering processors were Interchanged 
In some of the runs. Table 4.3 characterizes the -six runs made. Each 
of the six runs were cUssIfled using both 

a. the EODLARSYS Sum-of-Normal -Densities classifier (CLASSIFY 
using CATEGORY FILE control card); and 

b. the LARSYS standard maximum likelihood classifier 
(CLASSIFYPOINTS). 

Thus, a total of twelve classifications were generated and compared. 

1. Using 4-channe1 Landsat data, CLUSTER can handle approximately 10,000 
pixels (If N s number of pixels processed, then N < 40,000/n < 25,000, 
where n » number of channels, see Spencer and PhiTllps, 1973T. 




Table 4.3 Development of training statistics using the Nulticluster 
Block and Procedure 1 approaches (clustering processors Inter- 
changeable). 


Method of 
Developing 
Training Statistics 



Procedure 1 

Nulticluster Block^ 


Cluster Processor Used 



ISOCLS 


CLUSTER* 

Unseeded 

Seeded 



10 Iterations^ 

3 Iterations 1 

Iteration 

(LACIE) 


Run 1 
Run 3 

Run 6 

Run 5 

Run 2 
Run 4 


The purpose of making the six runs, and using both classifiers on 
each run was fourfold: 

1. The results of all the runs allow us to draw conclusions on 
which Is the better method of developing training statistics. 

2. By comparing Run 1 vs Run 2, and Run 3 vs Run 4. we may draw 
conclusions on which of the clustering functions - ISOCLS or 
CLUSTER - Is more efficient. 

3. The results of Run 1 vs Run 5 vs Run 6 allow us to judge the 
effects of seeding on CPU time used and accuracy of classifi- 
cation. 

4. Within any given run. the Sum-of-Normal -Densities classifier 
and the standard maximum-likelihood classifier may be com- 
pared. 

Each run Is described In detail below. For a given run. the training 
statistics processors remain constant; the letter following the number 
Indicates which classifier was used (a. the EODLARSYS Sum-of-Normal - 
Densities classifier, b. the LARSYS maximum likelihood classifier). 

5 

Note that the letter 'R* Indicates a statistics deck reformatting step. 


1. The number of Iterations refers to the number of spilt Iterations. 

2. Processors used to develop training statistics (cluster function varl- 

3. Processors used to develop training statistics (cluster function varl- 


4. Cannot be seeded, and Is Iterative. 

5. Two programs written by Carol Jobusch. LARS statistician, were used 
to make the LARSYS-EODLARSYS statistics deck conversions. 
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Run 1 : Order of Processors : 

nnTMTA-tcnric lABci /«• CLASSIFY-DISPLAY (SoNO) 
DOTOATA-ISOCLS-LABEL-(^^ R-CLASSIFYPOINTS-PRINTRESULTS (ML) 

The first run was a modified version of Procedure 1. All pro- 
cessors used to develop the training statistics were those used In 
Procedure 1, however ISOCLS was not seeded. The entire quad was clus- 
tered In order to produce the training statistics. Clustering was done 
on an Interval of two, and approximately 20 to 25 spectral classes were 
desired. Since the analyst rarely knows exactly how many spectral 
classes will be produced by ISOCLS for a given set of parameters, runs 
using ISOCLS (Runs 1 and 3) were done first. The number of spectral 
classes produced by ISOCLS equaled the number of classes requested of 
CLUSTER In Run 2 (to aid with clustering processor comparisons). 

Run 2; Order of Processors: 


OOTDATA-CLUSTER-R-LABEL- ( J* 

b. 


CLASSIFY-DISPLAY (SoND) 
R-CLASSIFYPOINTS-PRINTRESULTS (ML) 


This run used LABEL to Identify the spectral classes formed by the 
LARSYS clustering function. The unlabelled statistics produced by 
CLUSTER (MAXCLAS « 20, INTV » 1, CONV • 98.5) were reformatted for use by 
LABEL, CLASSIFY, and DISPLAY. The labelled statistics output by LABEL 
were reformatted prior to processing by LARSYS, CLASSIFYPOINTS and 
PRINTRESULTS (Run 2b). 


The same dotflle was used by the LABEL processor In Runs 1,2, 5, 
and 6. Runs 1 and 2 used the dotflle only for labelling the statistics 
output by ISOCLS and CLUSTER respectively. Runs 5 and 6 used the dot- 
file to seed ISOCLS and to label the clusters produced. 

Run 3: Order of Processors: 

ISOCLS-R-HFRGESTATISTirs-^FPARABTI ITV-f®* R“CLASS I FY- DISPLAY (SoND) 

IbUCLb K ntRQESTATlSUCS SEPARABILITY CLASSIFYPOINTS-PRINTRESULTS (ML) 

The third run used a Multicluster Block approach to developing 
training statistics on the Devil Mountain Quad, but used the EODLARSYS 
clustering function ISOCLS to develop the stat deck. Again the ISOCLS 
function was used first since the analyst did not have complete control 
over the number of clusters output. 

Each training block selected was clustered Individually; 14-16 
spectral classes were formed In a block. These spectral classes were 
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Identified by the analyst using a 1:24,000 llneprinter map of each train- 
ing block, aerial photography, and a Zoom Transfer Scope. All spectral 
classes formed were manipulated using the LARSYS MERGESTATISTICS end 
SEPARABILITY processors to produce the final training statistics dock. 

This stat deck was used to classify the area. 

Run 4: Order of Processors: 

... ..erpB ura«>erTA<rrcTTfe ert>ABADti tTtf /«• R-CLASSIFY-DISPLAY (SoNO) 
CLUSTER-MER6ESTATISTICS-SEPARABILlTY-(j,^ CLASSIFYPOINTS-PRINTRESULTS (ML) 

Run 4 was a standard Multicluster Block approach to developing 
training statistics, utilizing LARSYS processors throughout. The train- 
ing blocks used to develop the training statistics In Run 3 were used In 
this run, and the same number of spectral classes requested for each 
block as were produced In Run 3. In other words. If ISOCLS clustered the 
first and second training blocks (Run 3) Into 14 and 16 spectral classes 
respectively, then CLUSTER'S MAXCLAS parameter was set at 14 and 16 for 
these two blocks In Run 4 (CONY ■ 98.5, INTV ■ 1). 

Runs 1 and 2 (using EODLARSYS software, different clustering pro- 
cessors) had equal numbers of spectral classes. Runs 3 and 4 had equal 
numbers of spectral classes (or very nearly so). However, the number of 
spectral classes formed In Runs 1 and 2 did not necessarily equal or even 
approximate the number formed In Runs 3 and 4. This should not be of any 
great surprise since one of the advantages of the Multicluster Block 
method of developing training statistics Is Its ability to produce large 
numbers of spectral classes with a relatively small amount of CPU time. 

Run 5: Order of Processors: Same as Run 1. 

The fifth run used a LACIE classification approach to developing 
the training statistics for the two classifiers. Unlike the previous 
runs, ISOCLS was seeded, and was allowed to Iterate only twice (one 
split, one combine). Twenty-three dotflle seeds were used In an attempt 
to produce 20 (or more) clusters. Various numbers of each type of dot 
(Hardwood, Conifer, etc.) were Input according to 

1. approximate photo interpreted areal percentage of cover types, 
based on a quick scan of the 1:120,000 CIR photos, 

2. number of cover type dots In the Type 1 dot data set (I.e., 
percent of area covered by hardwoods is proportional to the 
number of hardwood dots/total number of dots), and 

3. expected variability within a class (Intuitive). 
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The spectral classes were labelled using LI distance and l~Nearest 
Neighbor. 

Run 6: Order of Processors: Same as Run 1. 

This run was most similar to Run S In that ISOCLS was seeded with 
Type 1 dots. The only difference between Runs 5 and 6 was that Run 6 
allowed ISOCLS to Iterate (3 split Iterations) and Run 6 used I? distance 
and 10-Nearest Neighbor to label the spectral classes (all runs except 5 
used this labelling procedure). 

The results of these classifications were analyzed using a Newman- 
Keuls Range Test; the procedure Involved steps outlined on page 2.7>11 
of Landgrebe (1976). The Maximum-Likelihood and Sum-of-Normal -Densities 
classifiers were compared using palred-t tests as outlined in 
Mendenhall (1975). The results of these classifications, and conclusions 
drawn from this Information are given In the subsequent sections. 
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CHAPTER 5 - RESULTS 
Parameter Study 

More than 50 separate classifications were done en route to com- 
pleting this portion of the study, although more work would have to be 
done In order to fully quantify the effects of the various parameters. 

The results Indicate trends and may provide useful rules of thumb. The 
reader should realize that replicates were not run. The results shown 
below are most certainly Influenced by characteristics of the data. The 
shape of the curves may be quite different depending on the dots used to 
seed ISOCLS, the values of the other clustering parameters, and the spec- 
tral characteristics of the study area. 

ISOCLS Parameter Study 

The Effects of Four Parameters: STDMAX, PERC, ISTOP, DLMIN 

The EODLARSYS clustering processor Is a complex program which al- 
lows the analyst to control the statistical 'shape' of the clusters 
formed by controlling standard deviations In each channel. Although the 
program has no single parameter which regulates the number of clusters 
resulting, the number may be managed by changing the values of certain 
ISOCLS parameters. The purpose of this study was to quantitatively de- 
fine the effects of STDMAX, PERCENT, ISTOP, and DLMIN on 

1. the number of clusters formed, 

2. the computer time used, and 

3. in some instances, the accuracy of classification. 

The effects of a given parameter on the performance of the clustering 
processor is often dependent on the values of the other parameters. For 
instance, changing PERCENT from 80 to 100 may have no effect when all 
other parameters are set at their default values, but the same change 
may significantly effect the number of clusters formed (and CPU time 
used) when the values of STDMAX and DLMIN are reduced. Other parameters 
are listed only when their values were not the default. Following is a 
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list of the default values for the parameters mentioned In this study. 

STDMAX 4.5 PERCENT 80 I STOP 10 

SEQUEN SC DLMIN 3.2 Number of Seed Dots 0 

SEP separates new cluster means by plus-minus one 

standard deviation In the channel (s) which do not 
meet the STDMAX criterion. 

In addition to Investiga ;1r. ; the effects of various parameters, the 
effect of seeding was considered. In the course of the parameter study, 
ISOCLS was allowed to self-start, and In other runs was seeded with a 
subset of the Type 1 dots (from the dotflle). The results of the para- 
meter and seeding study are given below. 

STDMAX: ISOCLS splits any cluster which exhibits standard devla- 

t1on(s) (within a channel) greater than the STDMAX value. Hence as 
STDMAX decreases, more clusters are split, and CPU time Increases.^ 

The results of clustering proved to be quite sensitive to changes In the 
STDMAX parameter. STDMAX had the greatest (and most predictable) effect 
on the number of clusters formed (see Figure 5.1). 

PERCENT: ISOCLS ceases processing 

1. when all split and combine sequences are exhausted (as spec- 
ified by SEQUEN), or 

2. when the percentage of stabilized clusters exceeds the 
PERCENT number. 

A stabilized cluster Is one whose standard deviations In all channels 
considered are less than STDMAX. Hence If at the beginning of an Iter- 
ation 20 clusters have been formed, and either 18, 19, or 20 are stable, 
then processing will cease if PERCENT is less than or equo^ to 90. 

The PERCENT parameter offers only gross control over C^U time and 
the number of clusters formed. With all other parameters at default, 
there was little difference between clusters formed at PERC=80 and PERC= 
90. Values of 90 and 100 produced Identical output. 

Using a different set of parameters and seeding ISOCLS with seven 
Type 1 dots produced the results seen In Figure 5.2. Though at best 
preliminary, these results Indicate PERC=90 may save CPU time without 

1. CPU times given In this parameter study section refer to the amount 
of computer time needed to cluster a 100 x 100 pixel forested area 
(Vailed to Study Area). 



Figure 5.1 Effects of STOMAX on ISOCLS performance. Empirical rela- 
tionship between STDMAX, number of clusters formed, and CPU time 
used (Vallecito study area). 



Figure 5.2 Effects of PERCENT on ISOCLS performance. Empirical rela- 
tionship between PERCENT, number of clusters formed, and CPU time 
used (Vallecito study area). 


CPU Time - ISOCLS (min) CPU Time - ISOCLS (min) 
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a large effect on the number of clusters formed. 

ISTOP: The analyst may designate the order of split and combine op- 
erations. ISTOP Informs the processor how many Iterations are done In 
the Initial split sequence (Indicated by the first 'S' on the SEQUEN 
control card). 

As a preliminary step, ISTOP was changed from 5 to 10, 15, and 20 
with all other parameters set at their default values (exception, 

SEP = 1.0). Evidently only 5 (or less) split Iterations were necessary 
to stabilize 80% (default value for PERCENT) of the clusters formed 
because there was essentially no change In CPU time used, and no change 
In the number of clusters produced. 

As DLMIN Is reduced, one would expect 

1. the number of clusters formed to Increase, and 

2. average Intercluster distance to decrease. 

The results Indicate that DLMIN values below the default do not affect 
the number of clusters produced and do not significantly affect the CPU 
time used. Hence the DLMIN default value was used In the subsequent 
classification study on the Devil Mountain quad. The effect of DLMIN may 
become more noticeable when the number of initial split Iterations Is 
reduced. DLMIN Is a divergence value. The average divergence value for 
all clusters formed in an iteration should increase as the number of 
iterations increase. Hence as ISTOP is increased, the effects of DLMIN 
should be reduced (i.e., fewer clusters should be combined for a given 
DLMIN value). 

In summary, as a result of this parameter study. Table 5.1 shows 
the parameter values defined for use with the ISOCLS processor in the 
classification study. 

The ISOCLS parameters developed on the Vallecito Study area pro- 
duced less than the desired number of spectral classes on the Devil 
Mountain quadrangle. Parameters were adjusted (specifically, STDMAX and 
DLMIN were decreased) to produce 20-25 spectral classes (Run 1) on the 
entire quadrangle and 14-16 spectral classes on each training block 
(Run 3). The final parameters used are noted in subsequent sections. 
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Figure 5.3 Effects of ISTOP on ISOCLS performance. Empirical rela- 
tionship between ISTOP, number of dusters formed, and CPU time 
used (Vallecito study area). 
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ISOCLS Parameters Used; 


- STDMAX 3.0 

- PERC 90 

- 7 Seed Dots 


- All parameters 

default H 


(76.5) (76.5) 



(60.0) 


Note; Classification Performance (%) in parentheses. 
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Figure 5.4 Effects of DLMIN on ISOCLS performance. Empirical rela- 
tionship between DLMIN and number of clusters formed using two 
ISOCLS parameter sets (Vallecito study area). 


54 


Table 5.1 Initial Devil Mountain classification study parameters for 
ISOCLS, based on the results of the Vail eel to ISOCLS parameter 
study. 


Parameter 

Run 1 

Run 3 

Run 5 

Run 6 

STDMAX 

2.6 

3.25 

15.0 

2.8 

PERCENT 

90 

90 

80 

90 

ISTOP 

10 

10 

1 

3 

DLMIN 

3.2 

3.2 

0 

3.2 

Dots Used 
(seeds) 

0 

0 

23 

23 

SEP 

default 

default 

1.0 

default 

CLUS 

25 

16 

60 

25 


The Effects of Seeding ISOCLS 

The second phase of the parameter study Involved an Investigation 
Into the effects of seeding ISOCLS. Three methods were compared 

1. Unseeded - 10 split Iterations, 

2. Seeded - 1 split Iteration, and 

3. Seeded - 10 split iterations. 

Unseeded - 10 Split Iterations: The ISOCLS processor clustered the 
study area four times. Each time STDMAX was decreased to study the ef- 
fects of the STDMAX parameter (see STDMAX section, pg 50). The number of 
clusters output were 7, 14, 22, and 32, accomplished by reducing STDMAX 
from 4.5 to 2.5. The following ISOCLS parameters were used: PERCENT 100, 
STDMAX variable, SEP 1.0, all other default. 

ISOCLS Is little more than a one-pass grouping processor when the 
parameter levels are set to LACIE specifications. The processor actually 
goes through the data three times. The first pass assigns each pixel to 
a seeded cluster mean using a minimum-distance criterion. The last two 
passes merely delete clusters which are too small (I.e., clusters which 
contain less than NMIN and PMIN pixels). No splitting or combining Is 
done since the STDMAX parameter Is very large (15.0) and the DLMIN para- 
meter Is very large (15.0) and the DLMIN parameter is set equal to zero. 
The other ISOCLS parameters used to implement this procedure are listed 
under Run 5 In Table 5.1. 
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Since the number of Input dots very nearly equals the number of 
clusters formed using LACIE parameters, 7, 14, 22, and 32 dots were In- 
put to ISOCLS. Table 5.2 presents a listing of the breakdown of the dots 
used for seeding. These breakdowns were chosen on the basis of expected 
variability within an Informational class. In other words^ more vari- 
ability Is expected In the hardwood or conifer stands scattered over the 
study area than Is expected In the water or barren categories, so more 
seeds were allotted to the forested categories. The actual seeds were 
chosen randomly; the hardwood seeds were selected at random from all 
Type 1 hardwood dots. Type 1 pixels were added to each category (hard- 
wood, grass, etc.) In order to Increase the number of seeds from 7 to 32. 


Table 5.2 Type 1 dots used to seed ISOCLS In each of the five Level II 
cover types (ISOCLS parameter study-VallecIto study area). 


Number 

of 

Seeds 

7 

14 

22 

32 


Identity of Seeds (Type 1 dots) 


Hardwood Conifer Grass Barren 


2 

4 

7 

10 


2 

4 

7 

10 


1 

3 

4 


1 

2 

3 


Water 

1 

1 

1 
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Seeded-10 Split Iterations: Realizing that seeding may save CPU 
tlme,^ the same dots (listed above) were Input Into the ISOCLS processor 
with the following parameters: PERCENT 100, STDMAX 3.0, all others de- 
fault. This mode of operation combined the time saving potential of seed- 
ing with the benefits of Iterative clustering. 

These three methods are compared In Figures 5.5 and 5.6. 

The results Indicate that (on the Vailed to study area): 

1. The LACIE parameters require the least CPU time to output a 


1. If ISOCLS Is not seeded, the first split Iteration produces only 2 
clusters, the second 4, the third 8 (at best), etc. Seeding estab- 
lishes cluster centers Immediately and removes the need for the Ini- 
tial self-start split Iterations. 







Figure 5.6 Relation between mmber of clusters formed and average accuracy of classification 
unseeded-lterative, seeded-LACIE, and seeded-iterative ISOCLS. 
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given number of clusters. 

2. Seeding does not necessarily save computer time when ISOCLS Is 
allowed to Iterate (10 split Iterations). 

3. Computer time was saved by Increasing the number of seeds 
from 7 to 14 to 22. The savings may have been due to the 
fact that fewer split Iterations were necessary to achieve 
stable clusters. 

4. None of the three methods produced average classification ac- 
curacies consistently higher than the other two. The average 
classification accuracies for all three methods ranged from 
78-87% when 14-32 spectral classes were output. 

5. The Iterative, seeded ISOCLS (7 seeds) formed 20 spectral 
classes and produced the best results. This particular 
parameter combination produced the only Individual cover type 
accuracies consistently above 70%. 

The accuracy of classification Is strongly Influenced by how well 
the statistical classes are labelled. Procedure 1 (of which ISOCLS Is 
part) uses an automatic labelling processor which has parameters sub- 
ject to analyst control. The effect of this processor on classification 
accuracy was Investigated, and the results are given In the next section. 

LABEL - Nearest Neighbor Influences 
The LABEL processor Identifies spectral classes ucing one of two 
methods: 1. K-Nearest Neighbor, 2. All-of-a-KInd (see pg 33for a 
description). The K-Nearest Neighbor process Identifies the spectral 
class only on the basis of numerical majority of the K-Mearest Neighbor 
dots. If K^S, the 5 dots closest to the given cluster mean are found. 

If three dots are hardwood, two are grass, the cluster is labelled hard- 
wood, even if the two grass dots are closer to the cluster mean. No 
weighting for proximity is done.^ 


1. Weighting by proximity could be accomplished by following these steps. 

1. Find the K-Nearest dots. 

2. Calculate the reciprocal of each of the K cluster mean-dot 
distances. 

3. Sum the reciprocals for each class. The identity of the spec- 
tral class is the category with the largest sum. 

For example, given that K equals 5 and the five closest dots have the 
following identities and associated cluster mean-dot distances (euclid- 
ean): 


Continued, bottom of next page. 
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The K-Nearest Neighbor LABEL processor was Investigated to try and 
determine the K which yielded the most accurate results. Two different 
statistics decks were used. The first deck contained 22 spectral 
classes formed using an unseeded. Iterative ISOCLS processor. The se* 
cond deck contained 21 spectral classes, formed by seeding ISOCLS (1 
split Iteration) with 22 dots. The results are shown In Figure 5.7. 

The same dotflle was usc.’ throughout the course of this parameter 
study. The file consisted of 83 Type 1 dots, 17 hardwood, 33 conifer, 

14 grass, 9 barren, and 10 water dots. 

Accuracies decreased markedly when 15*Nearest Neighbor was used to 
label the statistics decks. Ten-Nearest Neighbor yielded the highest 
(or nearly so) accuracies. The results Indicate that K should equal 
the number of Type 1 dots In the smallest (smallest In terms of num- 
ber of Type 1 dots) class. In this case, K approximately equal to 9 
would produce the best results. The reasoning behind this conclusion 
is twofold: 

1. Given an unknown population mean, the larger the sample size, 
the higher the probability that the sample mean adequately 
characterizes the population mean. In terms of this problem, 
the higher the K, the higher the probability that the spec- 
tral classes are correct1> labelled. 

2. Any K larger than the number of dots In the smallest class 
tends to 'confuse* LABEL when that class Is processed. ^ 


1. 


2. 


(con't from previous page) 


Category Distance 


Normally, this spectral class would be labelled 
hardwood, since three of the five closest dots 
are hardwood. 

If a weighted approach to labelling Is 
applied: 

Grass (weighted) > TjO 


Grass 1 .0 

Grass 1.5 

Hard 2.0 

Hard 2.5 

Hard 3.0 


Hard(welghted) 


1 

O 


1 

775 


1 

TTff 


1 .233, then the spec 


tral class takes on the Identity of the category with the largest 
sum. In this case. Grass. 

If there are m points In the smallest class, and K dots are used to 
label each spectral class, where m Is less than K, then at least 
K-m dots cannot be In that spectral class. Hence LABEL Is biased 
when m Is less than K. 



statistics decks 
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Based upon these results. It was decided that the second part 
of this study would use K-Nearest Neighbor labelling, with K equal to 
the number of Type 1 dots In the class with the smallest number of 
Type 1 dots. 

Comparison of the Hulticluster Blocks 

and Procedure 1 Approaches - Results 

This phase of the resecrch would have been greatly simplified If 
each of the runs had been made with the appropriate parameter levels. 
The ISOCLS parameter study done on the Vallecito study area Indicated 
clustering trends, buc It was found that this parameter study offered 
little predictive value when Devil Mountain quadrangle MSS data was 
processed. In Runs 1 and 3 (two of the four runs using ISOCLS), the 
study area or training blocks had to be clustered three times to ob- 
tain the desired number of clusters. Parameter levels which were ex- 
pected to yield about 15 spectral classes In a training block produced 
only seven. Parameters which would have output 20 to 25 spectral 
classes on the Vallecito study area produced only 13 on the Devil 
flountain quadrangle. The reduction In number of clusters formed on 
two different study sites (given the same ISOCLS parameters) may be due 
to the spectral complexity of the sites. The Vallecito 'i*ea Is rela- 
tively complex when compared with the Devil Mountain site which Is 
much more uniformly forested. Hence, an Iterative sequence was begun 
and parameters were manipulated until the desired number of spectral 
classes were obtained. Repetetive sequences also occured In the two 
Multicluster Block runs. The training block statistics were merged, 
often deemed unsatisfactory, and remerged. 

The analyst time spent and computer time used In each of the six 
runs (see Table 5.3) Include only that time spent for the final run. 

In other words, If an area was clustered three times (by ISOCLS) to 
produce 15 spectral classes, only that computer time used In the third 
try Is added to the total CPU figure. Likewise, If two merge sequences 
were necessary to produce an adequate final 'atlstics deck using a 
Multicluster Blocks approach, only the analyst time spent formulating 
the second deck Is summed Into analyst time used. Hence the CPU times 
and analyst's time noted apply to that hypothetical analyst familiar 
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processor. n^R&E. and SEPMUBILITT for the Nulticluster Block approaches. 

Number of pixels classified • 1687. 

Test pixels used; Conifer 932. Hardwood 457. Grass 252. Barren 46. Total 1687 (see Table 4.2). 
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with the McB approach and merging techniques and/or knowledgeable about 
the Intricacies of the ISOCLS processor and the data set. 

The main thrust of the research Involved comparing the Procedure 1 
method of classifying forestland Landsat MSS data to the McB approach 
to developing training statistics and classification using those sta- 
tistics. Below the results obtained for each of the six runs are dis- 
cussed. Four use a Procedure 1 approach to obtain the classification - 
Runs 1» 2, 5, and 6. Runs 3 and 4 use a Multicluster Block approach 
(see Tables 5.3 and 5.4). 

Table 5.4 Number of spectral classes In the statistics decks' used by 
the classifiers, by cover type, for each run. 


Run 

No. 



Spectral 

Classes Used 


Approach 

Con 

Hard 

Grass 

Barr 

Total 

1 

P-1 

ISOCLS 
unseed, Iter 

8 

6 

4 

2 

20 

2 

P-1 

CLUSTER 

11 

5 

3 

1 

20 

3 

McB 
ISOCLS 
unseed, Iter 

5 

5 

3 

2 

15 

4 

McB 

CLUSTER 

8 

4 

3 

3 

18 

5 

P-1 

ISOCLS 

seed,(LACIE) 

13 

4 

3 

2 

22 

6 

P-1 

ISOCLS 
seed, iter 

8 

6 

5 

1 

20 


Discussion 

of Results of 

Six 

Runs 



The results of each run are discussed below. The parameter levels 
used for individual processors (where Important) are given and any pro- 
blems encountered are noted. For a description of each run, refer to 
the Procedures section (pgs 45-48 ). 
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Run 1 

This Procedure 1 approach used the ISOCLS clustering processor In 
an Iterative mode (10 splits and one combine). Type 1 dots were not 
used to seed the processor. 

Twenty to twenty five spectral classes were desired on the Devil 
Mountain test slte.^ The area was clustered three times, the first two 
produced 13 and 18 spectral classes (STDMAX 2.6 and 2.2 respectively). 
The 13 spectral class statistics deck produced average and overall 
classification performances of 72.0 and 86. 5X respectively. These ac- 
curacies were 6.7 and 1.6% lower than the 20 spectral class statistics 

2 

deck generated when STDMAX was set to 2.0. Approximately nine CPU 
minutes were used In the first two attempts. 

The LABEL processor Identified 8 of the spectral classes as coni- 
ferous, 6 hardwood, 4 grass, and 2 barren. These labelled statistics 
were used by the classifiers; the results are given In Table 5.5. 

Compilation of the results from each of the six runs (see Table 
5.3} show that these statistics produced the best overall and average 
classification accuracies. 

Run 2 

Run 2 Is essentially the same as Run 1 except that the LARSYS 
CLUSTER processor was used (Instead of ISOCLS) with the EODLARSYS pro- 
cessors DOTDATA and LABEL to produce the training statistics. Twenty 
spectral classes were requested, and the entire quad clustered on an 
Interval of two. CLUSTER (CONV 98.5) took twice as much CPU time to 
produce the same number of clusters though only one clustering attempt 
was necessary (see Table 5.14). 

The labelled training statistics produced classification results 
lower than the first run's, though only the average classification ac- 
curacy was significantly lower (according to the Newman-Keuls Range 

1. Twenty to twentyfive spectral classes were deemed desirable based 
on results of the parameter study and on previous classification 
experience. 

2. Other parameter levels used to cluster the Devil Mountain quad- 
rangle (all Run 1 attempts): PERC 90, ISTOP 10, DLMIN 3.2, CLUS 
25, Number of Seed Dots 0. 


65 


Table 5.5 Classification results using Run 1 training statistics and 

a. the LARSYS Maximum Likelihood classifier, 

b. the EODLARSYS Sum-of-Normal-DensItles classifier 
on the Devil Mountain quadrangle. 


a. Maximum Likelihood classification results: 



No of 
Samps 

Perc 

Corct 

Number of Samples 

Classified Into 

Group 

Con 

Hard 

Grass 

Barr 

Con 

932 

98.3 

916 

15 

1 

0 

Hard 

457 

83.4 

45 

381 

30 

1 

Grass 

252 

64.3 

14 

50 

162 

26 

Barr 

46 

65.2 

7 

0 

9 

30 


Average Performance: (311.2/4) = 77.8 

Overall Performance: (1489/1687) = 88.3 


b. Sura-of- Normal -Den si ties classification results: 


Group 

No of 
Samps 

Perc 

Corct 

Number of Samples Classified Into 

Con 

Hard 

Grass 

Barr 

Con 

932 

98,0 

913 

18 

1 

0 

Hard 

457 

83.6 

43 

382 

31 

1 

Grass 

252 

63.5 

13 

47 

160 

32 

Barr 

46 

69.6 

6 

0 

8 

32 


Average Performance: (314.7/4) = 78.7 

Overall Performance: (1487/1687) = 88.1 
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Test, Table 5.12). LABEL found 11 conifer spectral classes, 5 hard- 
wood, 3 grass, and one barren class. The confusion tables produced by 
the classifiers using these training statistics are reproduced In Table 

5.6. 

Unlike Run 1, the barren class was Identified with a very low ac- 
curacy. Many barren test field points were classified as grass. Evi- 
dently the CLUSTER processor formed a mixed spectral class which was 
labelled as grass, a class which might have more accurately been la- 
belled barren. Procedure 1, without analyst participation, uses all 
spectral classes produced by the clustering processor. Mixed spectral 
classes (grass-barren mixtures) must be labelled and used. These 
classes would more than likely be deleted If encountered In a McB ap- 
proach. In defense of the P-1 approach, the LABEL processor (10-Near- 
est Neighbor) did better than anticipated, and In general produced 
statistics of higher quality (as judged by classification accuracy) 
than those produced by this analyst. 

Run 3 

Run 3 was the first of two Multicluster Blocks approaches Investi- 
gated In this study. This run used the ISOCLS clustering processor in 
an unseeded. Iterative mode to cluster three training blocks establish- 
ed on the Devil Mountain quadrangle. Between 14 and 16 spectral class- 
es were desired on each block. 

Again the problem of not being able to control the number of spec- 
tral classes output by ISOCLS plaqued this run. The number of spectral 
classes produced by ISOCLS Is very difficult to estimate If the pro- 
cessor Is not seeded and Is allowed to Iterate. Each block was clus- 
tered three times; the parameters used and the results are given below. 
Those parameters not mentioned were set to their default values. No 
seeding was done (no Type 1 dots Input). 

The data Is given first to provide more insight into the ISOCLS 
processor, second to document two oddities. As explained in the para- 
meter study section, one expects the number of spectral classes pro- 
duced by ISOCLS to increase as STDMAX and/or DLMIN is reduced. The re- 
sults from training block 1 (cluster attempts 1 and 2) show this is not 
always the case. Much mathematical handwaving concerning data 


Table 5.6 Classification results using Run 2 training statistics and 

a. the LARSYS Maximum Likelihood classifier, 

b. the EODLARSYS Sum-of-Normal -Densities classifier 
on the Devil Mountain quadrangle. 


a. Maximum Likelihood classification results: 


Number of Samples Classified Into 


Group 

IW OT 

Samps 

rerw 

Corct 

Con 

Hard 

Grass 

Barr 

Con 

932 

96.8 

902 

30 

0 

0 

Hard 

457 

80.7 

40 

369 

47 

1 

Grass 

252 

76.2 

16 

36 

192 

8 

Barr 

46 

41.3 

3 

0 

24 

19 


Average Performance: (295.0/4) = 73.8 

Overall Performance: (1482/1687) = 87.8 


b. Suro-of-Normal -Densities classification results; 



No of 
Samps 

Perc 

Corct 

Number of Samples 

Classified Into 

Group 

Con 

Hard 

Grass 

Barr 

Con 

932 

95.9 

894 

38 

0 

0 

Hard 

457 

79.9 

37 

365 

54 

1 

Grass 

252 

77.0 

12 

26 

194 

20 

Barr 

46 

45.7 

3 

0 

22 

21 


Average Performance; (298.5/4) * 74.6 

Overall Performance; (1474/1687) = 87.4 
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Table 5.7 Parameter values used to duster training blocks using 
ISOCLS, Run 3. 

Tr Cluster Par a me t y . J.ev el s ^ 

Block Attempt PERC DLMIN ISTOP STDMAX CLUS Pixels Used (min) Formed 


1 

1 

9D 

3.2 

ID 

2.8 

16 

841 

D.919 

7 


2. 

9D 

2.4 

ID 

2.D 

16 

841 

D.598 

6 


3^ 

1DD 

2.D 

ID 

1.7 

16 

841 

D.9D9 

15 

2 

1 

9D 

3.2 

ID 

2.8 

16 

18D4 

1.5D7 

9 


2 

9D 

2.6 

ID 

2.1 

16 

18D4 

1.283 

13 


3 

9D 

2.5 

ID 

2.D 

16 

18D4 

1.219 

15 

3 

1 

9D 

3.2 

ID 

2.8 

16 

1378 

1.452 

ID 


2 

9D 

2.8 

ID 

2.1 

16 

1378 

1.283 

13 


3 

9D 

2.7 

ID 

2.1 

16 

1378 

1.D21 

15 


+ PNIN dropped from 20 to 5. 

distribution In the training block and the ISOCLS parameter values 
could explain the phenomenon noted (scientific hindsight can be very 
accommodating). Suffice It to say that reducing STDMAX or OLMIN does 
not necessarily Increase the number of spectral classes produced. The 
second feature of Interest Is the CPU time used. Due to the character- 
istics of the IBM computing system, the CPU time required for a pro- 
cessor does not always indicate the actual computing time used for 
clustering. If the LARS computer is busy, it begins spending an Inor- 
dinate amount of time spooling programs in and out of memory core. 
Processing time can be increased by more than 5D% for exactly the same 
job; therefore the CPU time used depends (In part) on system demands. 
Hence the decrease In CPU time noted for all three training blocks as 
more spectral classes were produced may be an artifact of the computing 
system and the demands on that system at the time the jobs were run. 
Attempts were made throughout the study to minimize the effects of 
these computing inefficencles by running all the programs Involved in 
this study interactively between 6 pm and 1 am (when system demands are 
low) or by running jobs night batch. 

The fifteen spectral classes produced In each training block were 
identified using color infrared photography, the cluster map output 
from ISDCLS, and a Zoom Transfer Scope. The cluster classes output on 
each training block were very difficult to photo interpret. This 
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difficulty is reflected in the fact that it took over twice as long to 
photointerpret the training blocks; it took 2.5 hours to identify the 
spectral classes produced by ISOCLS, and about 1.1 hours to identify 
the CLUSTER (Run 4} spectral groupings. The ISOCLS spectral classes 
seemed confusing, more randomly distributed across cover types. Thir- 
teen spectral classes (out of 45} were identified as mixed classes and 
were deleted prior to the Merge process. Only 4 (out of 45) were de- 
leted from the statistics produced by CLUSTER in Run 4. 

The modified merge procedure described in the Procedures section 
was done twice. The first sequence produced a statistics deck deemed 
unsatisfactory because the results of the first merge sequence indi- 
cated that the analyst slighted the grass category in favor of hard- 
wood and barren. The same statistics were merged a second time and 
used by the classifiers; the confusion tables are presented in Table 
5.8. The final statistics deck used by the classifier contained 15 
spectral classes, 5 conifer, 5 hardwood, 3 grass, and 2 barren classes. 

Throughout this study, the quality of the training statistics was 
judged soley on the classification accuracies obtained. A Newman-Keuls 
Range Test placed the Run 3 training statistics in the lower half of 
the six studied. The lower performance of this Multicluster Block at- 
tempt was expected due (in part) to the inferior quality of the unla- 
belled training statistics output by ISOCLS using the parameters listed 
in Table 5.7. 

Run 4 

The training statistics output by CLUSTER proved to be much easier 
to photointerpret compared to those output by ISOCLS in Run 3. The re- 
sulting classification accuracies improved significantly. 

The three training blocks were clustered individually into 15 
spectral classes, taking 5.777 minutes of computer time. In general, 
ISOCLS could generate the same number of spectral classes as CLUSTER 
using much less CPU time. However, additional computer time was neces- 
sary in order to find the correct parameter levels so that ISOCLS could 
produce an acceptable number of clusters. The final result was that 
CLUSTER turned out to be the less expensive and less aggravating of the 
two clustering processors. 
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Table 5.8 Classification results using Run 3 training statistics and 

a. the LARSYS Maximum Likelihood classifier, 

b. the EODLARSYS Sum-of-Norraal -Densities classifier 
on the Devil Mountain quadrangle. 


a. Maximum Likelihood classification results: 


No of Perc *^^*"*^^*^ Samples Classified Into 


Group 

no OT 

Samps 

rerg 
Co ret 

Con 

Hard 

Grass 

Barr 

Con 

932 

95.0 

885 

47 

0 

0 

Hard 

457 

83.6 

27 

382 

47 

1 

Grass 

252 

46.8 

0 

131 

118 

3 

Barr 

46 

60.9 

0 

4 

14 

28 


Average Performance: (286.2/4) = 71.6 

Overall Performance: (1413/1687) = 83.8 


b. Sum-of-Normal -Densities classification results: 


No of 
Group Samps 


Perc 
Co ret 


Number of Samples Classified into 
Con Hard Grass Barr 


Con 

932 

95.1 

886 

46 

0 

0 

Hard 

457 

80.5 

30 

368 

58 

1 

Grass 

252 

50.0 

0 

122 

126 

4 

Barr 

46 

63.0 

0 

3 

14 

29 


Average Performance: (288.6/4) = 72.2 

Overall Performance: (1409/1687) = 83.5 
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Again the merging process was done twice since results of the 
first merge were Judged unsatisfactory (based on initial classification 
accuracies obtained for certain cover types, specifically hardwood or 
grass). The second attempt used the same training statistics; photo- 
interpretation was checked and the 45 spectral classes recombined. The 
final statistics deck contained 18 spectral classes, 8 conifer, 4 hard- 
wood, 3 grass, and 3 barren classes. The results are noted in Table 
5.9. 

Run 4 was the only run which gave statistically similar results as 
Run 1 in both accuracy categories (0.5 alpha level) (see Table 5.12). 

Run 5 

This run investigated the effects of the LACIE approach to the 
classification of forestlands. As such, I30CLS was seeded with 23 Type 
1 dots (10 conifer, 7 hardwood, 4 grass, and 2 barren dots). The para- 
meter levels used for clustering and labelling were those set by LACIE 
(and listed in Table 5.1, Run 5). LABEL identified the 22 spectral 
classes (13 conifer, 4 hardwood, 3 grass, and 2 barren classes) and 
these labelled statistics were used by the classifiers with the results 
given in Table 5.10. 

The Newman-Keuls Range Test showed that the Procedure 1 approach 
using LACIE parameters was significantly lower in both average and 
overall accuracy than any other run at the a = 0.05 level (see Table 
5.12). Of the four Procedure 1 approaches attempted, this one used 
the least CPU time, since LACIE parameters require the clustering 
processor to pass through the data only three times. The method used 
to label the statistics (K-Nearest-Neighbor, K=l) makes this procedure 
sensitive to mislabelled dots, f'ssibly reducing chances of correct 
spectral class identification. 

The LACIE parameters may work well in agricultural settings where 
a dot identified as wheat may be counted on to exhibit relatively uni- 
form reflectance values, no matter where it is located in the segment. 
Extensive heterogeneity is the rule in the forests of the San Juan 
Mountains; this characteristic is perhaps the prime reason behind the 
poor showing of the P-1 (LACIE) approach for classifying some 'overtypes 
in this forested scene. 
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Table 5.9 Classification results using Run 4 training statistics and 

a. the LARSYS Maximum Likelihood classifier, 

b. the EODLARSYS Sum-of*Norma1 -Densities classifier 
on the Devil Mountain quadrangle. 


a. Maximum Likelihood classification results: 


Number of Samples Classified Into 


Group 

Samps 

Corct 

Con 

Hard 

Grass 

Barr 

Con 

932 

97.7 

911 

20 

1 

0 

Hard 

457 

75.1 

40 

343 

72 

2 

Grass 

252 

69.8 

0 

62 

176 

14 

Barr 

46 

58.7 

0 

0 

19 

27 


Average ?erformance; (301.3/4) « 75.3 

Overall Performance: (1457/1687) » 86.4 


b. Sum-of-Normal-Dens1t1es classification results: 



No of 
Samps 

Perc 

Corct 

Number of Samples 

Cla-i 

Ifled Into 

Group 

Con 

Hard 

Grass 

Barr 

Con 

932 

97.2 

906 

23 

3 

0 

Hard 

457 

74.4 

37 

340 

78 

2 

Grass 

252 

73.8 

0 

53 

186 

13 

Barr 

46 

58.7 

0 

0 

19 

27 


Average Performance: (304.1/4) ■ 76.0 

Overall Performance: (1459/1687) ■» 86.5 
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Tables. 10 Classification results using Run 5 training statistics and 

a. the LARSYS Maximum Likelihood classifier, 

b. the EODLARSYS Sum-of-Normal -Densities classifier 
on the Devil Mountain quadrangle. 


a. Maximum Likelihood classification results: 


No of Perc l^**"*^*^'^ 0^ Samples Classified Into 


Group 

Samps 

Corct 

Con 

Hard 

Grass 

Barr 

Con 

932 

99.4 

926 

6 

0 

0 

Hard 

457 

69.1 

69 

316 

68 

4 

Grass 

252 

33.7 

16 

52 

85 

99 

Barr 

46 

45.7 

6 

0 

19 

21 


Average Performance: (247.9/4) ■ 62.0 

Overall Performance: (1348/1687) ■ 79.9 


b. Sum-of-Normal-DensIties classification results: 



No of 
Samps 

Perc 

Corct 

Number of Samples 

Classified into 

Group 

Con 

Hard 

Grass 

Barr 

Con 

932 

99.4 

926 

6 

0 

0 

Hard 

457 

70.0 

61 

320 

71 

5 

Grass 

252 

30.2 

15 

45 

76 

116 

Barr 

46 

45.7 

6 

0 

19 

21 


Average Performance: (245.3/4) ■ 61.3 

Overall Performance; (1343/1687) ■ 79.6 
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Run 6 

The final run attempted to combine the "best” features of Runs 1 
and 5, I.e., the capability of ISOCLS to accept Initial cluster centers 
and Its controlled Iteration capabilities. The processor was seeded 
with the same dots used In Run S, and ISTOP was Increased to three (fOur 
passes through the data). Labelling was done using the 10-Nearest- 
Nelghbor approach. The extra Iterations minimally Increased CPU time. 

The Iterations combined with the Increased number of dots used to label 
the spectral classes resulted In an eight to ten percent Increase In 
classification accuracy over the LACIE approach (Run S). 

Twenty spectral classes were output by ISOCLS « 8 were labelled 
conifer. 6 hardwood. 5 grass, and 1 barren. See Table 5.10 for the 
cross-classification results. 

The barren cateogry was classified very poorly, the lowest of any 
of the six runs In part because the category was characterized by only 
one spectral class. In spite of this. Run 6 results ranked In the 
upper half of the six runs In overall accuracy, and was not statistically 
different from Run 1 (see Table 5.12). However. It's capabilities are 
more clearly shown by evaluating the average classification accuracy, 
where It ranked In the lower third. 

Analysis of the Results of the Six Runs 
Performance and Test Fields 

The test fields used to evaluate the runs were conscientiously 
selected to represent the entire range of stand conditions. In other 
wordr. a field was considered forested If more than 30t of the area was 
tree crown. A forested test field was Judged to be hardwood If more 
than 50% of the crowns present were hardwood. Forested test fields 
were selected In pure stands and In mixed, hence hardwood-conifer cross- 
classification was expected. Coniferous test fields were rarely mls- 
classlfled. but hardwood fields were often misclassifled Into conifers 
or grass. The conifer- hardwood mixup may be due to the attributes of 
the hardwood test fields (I.e.. many hardwood test fields had high 
percentages of conifer In them). 

The hardwood-grass cross-classification was most likely due to the 
spectral similarity of the two classes. The difference, between the two 


Table 5.11 Classification results using Run 6 training statistics and 

a. the LAKSYS Naxlmuin Likelihood classifier, 

b. the EODLARSYS Sum-of-Normal-DensItles classifier 
on the Devil Mountain quadrangle. 


a. Maximum Likelihood classification results: 


No of Perc **^*"*>*>* of Samples Classified Into 


Group 

Samps 

Corct 

Con 

Hard 

Grass 

Barr 

Con 

932 

99.0 

923 

8 

1 

0 

Hard 

457 

81.2 

51 

371 

34 

1 

Grass 

252 

64.7 

8 

79 

163 

1 

Barr 

46 

39.1 

7 

0 

21 

18 


Average Performance; (284.0/4) ■ 71.0 

Overall Performance; (1475/1687) ■ 87.4 


b. Sum-of-Normal-DensItles classification results; 


No of Perc dumber of Samples Classified Into 


Group 

Samps 

Corct 

Con 

Hard 

Grass 

Barr 

Con 

932 

98.9 

922 

9 

1 

0 

Hard 

457 

81.4 

50 

372 

?4 

1 

Grass 

252 

65.1 

8 

79 

164 

1 

Barr 

46 

39.1 

7 

0 

21 

18 


Average Performance: (284.5/4) ■ 71.1 

Overall Performance: (1476/1687) ■ 87.5 
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might be more pronounced at a later date* but In early June In the 
southern Rockies, the new hardwood flush has a high Infrared reflectance, 
much like that of the grasslands. The statistics for Runs 3 and 4 were 
remerged due to this he rdwood- grass mixup. Hardwood accuracies were In- 
creased only at the expense of grass, and vice-versa. This cross-classi- 
fication plagued all six runs, but was most noticeable In Runs 3 and 4, 
the McB approaches. 

The barren class had, of the four considered, the worst classifi- 
cation accuracy over the six runs. For the most part. It was generally 
confused with grassland, though cross-classification with conifer oc- 
curred regularly, but less often. The barren cover type makes up only a 
small percentage of the Devil Mountain quad, and occurs In Irregular 
patches. It occurs most often at the tops of peaks (and tirades Into 
grassland-tundra) and comprises a small section of the Pledra River 
valley. In union with sparse stands of Ponderosa Pine (less than 30% 
crown closure). Hence the barren-grassland cross-classification may be 
the result of the test fields' edge effects. Only the four P-1 
approaches classified barren test field pixels Into the conifer class. 
Evidently, LABEL was not able to distinguish the unlabclled sparse coni- 
fer-barren spectral classes as barren classes since there were not enough 
Type 1 dots exhibiting similar attributes to constitute a majority. 

These classes were successfully Identified by the analyst using the McB 
appraoch. 

A Comparison of the Methods of Developing Training Statistics 

Newman-Keuls Range tests were run on the oversll and average 
accuracies obtained for each of the six runs. The procedure Involved 
steps outlines on page 2.7-11 of Landgrebe (1976). 

Procedure 1 approaches are responsible for the highest and lowest 
classification performances, indicating that the P-1 approach carries 
much potential if used properly. The P-1 approach using LACIE parameters 
are definitely not the proper way to classify a forested scene. 

Runs 1 and 4 are not statistically different at the 0.05 alpha 
level, and are the two best approaches to developing training statistics 
for forestland classification. The use of either of these two methods 
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would depend upon the size to the area being classified and the ancillary 
Information available to the analyst. 


Table 5.12 Newman-Keuls Range Tests of average and overall classifi- 
cation accuracies for the six runs (Devil Mountain study site). 

Note; Runs sharing a common line are not significantly different at 
the 0.05 alpha level. 

The maximum likelihood classification accuracies were used In 
this analysis. 

a. Average Classification Accuracy: 

Run No. 1 4 2 3 6 5 

P-1 McB P-1 McB P-1 P-1 

ISOCLS CLUSTER CLUSTER ISOCLS ISOCLS ISOCLS 

unseed. Iter unseed, Iter seed, Iter seed,LACIE 


Accuracy: 77.8 75.3 73.8 71.6 71.0 62.0 


b. Overall Classification Accuracy: 


Run No. 1 2 6 

P-1 P-1 P-1 

ISOCLS Cluster isocls 
unseed.lter seed. Iter 

88.3 87.8 87.4 


4 3 5 

McB McB P-1 

cluster isocls isocls 

unseed, Iter seed,LACIE 

86.4 83.8 79.9 


The Procedure 1 approach using ISOCLS as an unseeded. Iterative 
processor would be best where forest Inventory Information is available 
and can be located In the Landsat grid. For instance, this approach 
would be ideal in Minnesota where the USFS has established a grid system 
of photointerpreted and ground checked points. The field points have 
been located on a Universal Transverse Mercator rectangular grid coordi- 
nate system. The UTM coordinates are easily converted to Landsat coor- 
dinates using a simple Fortran program. These points of known identity 
have been established in forest, agricultural, and urban areas, and on 
water. In some instances however, certain cover types will not be 
adequately represented by the inventory information supplied. In such 
cases, the analyst must locate and Identify additional pixels in cover- 
types lacking adequate representation. Once this inventory information 
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Is Input (requiring little analyst interaction), the classification 
process is completely machine-oriented. The disadvantage to the P-1 
procedure is that the entire area of interest must be clustered in order 
to develop the unlabelled training statistics. Costs may be kept reason 
able by clustering large areas on an interval, i.e., cluster a repre- 
sentative sample of the data. The P-1 approach has potential for an 
automated, operational forestlands classification system, however, the 
software would have to be streamlined to be cost effective. 

The Multicluster Block approach (using CLUSTER) to developing the 
training statistics becomes more attractive as the size of the study 
area increases. Savings in computer time, though noticeable, were not 
great in this study since the entire Devil Mountain quad was not large 
when compared to the training blocks selected.^ The McB approach is 
also suited to areas where ancillary Information is scarce. Forest 
inventory information may be an aid in identifying the spectral classes 
in each block, but quality airphotos or a type map are more useful. 
Following is a cost evaluation of the time used to develop the 

training statistics for the Devil Mountain quad for each of the six 
2 

runs. The evaluation is based on a $5.t0 charge for a computing 
minute ($300. 00/hour) and $10.00 per hour for analyst cost. The first 
co't column assumes the analyst had to locate and identify his own 
Type 1 dots for the P-1 approaches (as was the case in this study). 

The second cost column assumes forest inventory information is avail- 
able and the analyst spends one hour reformatting this information to 
make it available to the DOTDATA processor. 

The cost evaluation is really relevant only to those who might 
wish to institute an operational system using the current software, at 
best highly unlikely. However, the point can be made that the P-1 
approaches are cheaper than the McB approaches when forest inventory 

1. The McB approach used 12.1% of the available data on the Devil 
Mountain quadrangle. Procedure 1, clustering on an interval of 2, 
used 25.2% of the available data. 

2. In the case of Runs 1, 2, 5, and 6, the P-1 runs, the processors in- 
volved are DOTDATA, ISOCLS or CLUSTER, and LABEL. In the McB 
approaches. Runs 3 and 4, ISOCLS or CLUSTER, MERGE, and SEPARABILITY 
CPU times, are summed. 
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Table 5.13 Cost of developing training statistics for the Devil 
Mountain quadrangle 


Run 

Approach 

Cluster 

Function 

Analyst 

Time 

(hours) 

1 

P-1 

ISOCLS 
unseed, iter 

5 

2 

P-1 

CLUSTER 

5 

3 

McB 

ISOCLS 
unseed, iter 

4.5 

4 

McB 

CLUSTER 

2.6 

5 

P-1 

ISOCLS 

seed,LACIE 

5 

6 

P-1 

ISOCLS 
seed, iter 

5 


Computer 

Time 

(min) 

Total 

Cost ($) 

Estab Dots 

Inven Avail 

7.403 

87.02 

47.02 

13.502 

117.51 

77.51 

4.119 

65.60 

65.60 

7.330 

62.65 

62.65 

4.217 

71.09 

31.09 

4.693 

73.47 

33.47 


information is available. Inventory information reduces analyst involve- 
ment in the P-1 process markedly, whereas the inventory information 
really has no effect at all on the amount of time spent by the analyst 
involved in a Multicluster Block approach. This point Is important when 
considering future classification systems. As computer technology grows 
and softvwre is refined, the amount of CPU time used in a given process 
will be negligible when compared with the analyst expenses. The McB 
approach is analyst intensive, an analyst must be involved in developing 
the training statistics. P-1 makes no such demands (or minimal demands) 
if inventory information is available. As such, P-1 may be the most 
cost-effective approach to developing training statistics. Table 5.13 
also suggests that seeding the ISOCLS processor may be the cheapest 
method of developing the statistics. This study only scratched the 
surface of the effects of seeding and number of clustering iterations on 
the accuracy of classification. The seeded approach may yet prove to be 
the most accurate and cost effective method of classifying forested lands 
using available forest inventory data. 

The Classifiers 

The training statistics developed in each of the six runs were used 
by two different classification procedures, the EODLARSYS Sum-of-Normal - 
Densities classifier (CLASSIFY) and the LARSYS maximum likelihood 
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classifier (CLASSIFYPOINTS). Both are perpoint processors; both are 
more thoroughly described in the Materials section. 

Paired T-tests were run on the transformed accuracies (arcsin /p, 
p * average or overall accuracy) to determine statistically significant 
differences between the classifiers. No significant difference was 
found, even at the 0.10 alpha level. 

Table 5.14 Differences in average and overall accuracies between the 
two classifiers for the six runs. Devil Mountain quadrangle. 

Note: The letters in parentheses indicate the classifier which 

produced the higher accuracy. SoND - Sum-of-Normal -Densities 

ML - Maximum-Likelihood 

Note: Paired-T statistical analysis used transformed data; the 

numbers below have not been transformed. 


Average Accuracy Overall Accuracy 


Run 

SoND 

ML 

Di f ference 

SoND 

ML 

Di f ference 

1 

78.7 

77.8 

0.9 (SoND) 

88.1 

88.3 

0.2 

(ML) 

2 

74.6 

73.8 

0.8 (SoND) 

87.4 

87.8 

0.4 

(ML) 

3 

72.2 

71.6 

0.6 (SoND) 

83.5 

83.8 

0.3 

(ML) 

4 

76.0 

75.3 

0.7 (SoND) 

86.5 

86.4 

0.1 

(SoND) 

5 

61.3 

62.0 

0.7 (ML) 

79.6 

79.9 

0.3 

(ML) 

6 

71.1 

71.0 

0.1 (SoND) 

87.5 

87.4 

0.1 

(SoND) 


In general, the maximum- likelihood classifier produced higher overall 
accuracies, and the SoND classifier produced better average accuracies 
(again, statistically indistinguishable). 

A study by Scholz, Fuhs, and Hixson, 1979, compared five classifi- 
cation processors, among them the two used in this study. They found 
that the Sum-of-Normal-Densities classifier took up to five times as 
much computer time as the maximum-likelihood classifier. In each of 
the six runs in this study, the classifiers categorized 1687 pixels 
using all four channels of single acquisition Landsat data. A paired- 
T test was run to determine if there were significant differences in 
CPU time used between the classifiers. No differences were found 
'^calculated ~ ^.29). In fact, in five of the six runs, the 
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maxlmutn-likellhood classifier took longer, though CPU time differences 

only Involved magnitudes of tenths of a minute (see Table 5.3). 

Scholz, Fuhs, and Hixson, 1979, concluded that: 

"the key to accurate classifications Is In the development 
of the training statistics for the classification rather than 
In the classifier Itself." 

This research confirms their final conclusions, but disagrees with the 
classification time element. Perhaps differences would be more notice- 
able If more pixels were classified (such as the 5 x 6 mile segment - 
22,932 pixels - used In the study mentioned). 

The Clustering Processors 

The processors used to develop the training statistics In Runs 1 
and 2 (P-1 approach) and In Runs 3 and 4 (McB approach) were Identical 
except for the clustering processors. Differences In computer time and 
in classification accuracy then are a direct result of differences In 
ISOCLS and CLUSTER. The parameters used In each of the runs are de- 
tailed In the description of the Run results. 

Table 5.15 indicates that it taxes CLUSTER approximately twice as 
long as ISOCLS to output a given number of spectral classes. 

Table 5.15 Comparison of CPU time used to develop unlabelled training 


statistics 

using the 

ISOCLS and 

CLUSTER processors. 



Area 

No. Pixels 

Time (min) used 

No. spectral 

Approach 

Clustered 

Clustered 

ISOCLS 

CLUSTER 

classes 

P-1 

Dev Mt Qd 

8372 

6.04 

12.09 

20 


Tr B1 1 

841 

0.91 

0.90 

15 

McB 

Tr B1 2 

1804 

1.22 

2.86 

15 


Tr B1 3 

1378 

1.02 

2.02 

15 

McB( total ) 

4023 

3.15 

5.78 

45 


More important however is an analysis of the accuracies (see Runs 
1, 2, 3, and 4, average and overall accuracies. Table 5.3). Average ac- 
curacies noticeably declined (approximately 4%) when CLUSTER was used in- 
stead of ISOCLS in the P-1 approach. The overall accuracy was minimally 
affected. On the other hand, McB accuracies using statistics generated 
by CLUSTER (Run 4) were noticeablely higher (47, average, 37, overall) than 
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the ISOCLS (Run 3) results. The fact that the Run 3 accuracies were 
lower than Run 4's was no suprise since identification of the ISOCLS 
training block spectral classes was very difficult. 

The results of the cluster processors comparison is best summed up 
by listing the advantages and disadvantages of each. 

ISOCLS 

Disadvantages: . 

1. Cannot specify number of spectral classes output. 

2. Intricate parameter controls require much analyst experience 
if ISOCLS is to be used properly. 

3. The spectral classes output were difficult to identify using 
photointerpretive methods; the 'quality' of the spectral 
classes seemed low. 

Advantages: 

1. Can accept beginning cluster means which reduces CPU time. 

2. In general, requires less CPU time than CLUSTER to output 
a given number of spectral classes even when not seeded. 

CLUSTER 

Disadvantages; 

1. Cannot accept initial cluster means (a characteristic of the 
LARS program only). 

2. Requires more CPU time. 

Advantages : 

1. Much easier to use, parameters simple. 

2. Number of spectral classes requested equals the number out- 
put. 

In summary, do not mix software clustering processors. ISOCLS 
seems best suited to a P-1 approach, CLUSTER to a McB approach. 
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CHAPTER 6 - CONCLUSIONS AND RECOMMENDATIONS 

Six different methods Involving two general approaches were tested 
to determine the best method of developing training statistics. Experi- 
mental design allowed comparison of the two classification processors 
and the two clustering processors Involved In the study. 

Conclusions 

The Procedure 1 approach to developing training statistics Involves 
the use of the DOTDATA, ISOCLS and LABEL processors. The statistics 
output by these EODLARSYS programs showed great promise for accurately 
classifying forested areas. The Procedure 1 approach to developing 
training statistics Is extremely versatile. Only four different para- 
meter sets were tested. Conclusions based on this research may be 
drawn : 

1. The P-1 approach using ISOCLS In an unseeded. Iterative mode 
and the Multicluster Blocks approach using CLUSTER were the 
two best approaches to developing training statistics. The 
use of either method would basically depend upon the type 
and amount of ancillary Information available to the 
analyst. 

2. The P-1 approach makes maximum use of current forest inven- 
tory Information. Availability of this Information mini- 
mizes the analyst Involvement In the classification proced- 
ure. The reduction In analyst Involvement Is critical be- 
cause a. In an operational mode, analyst time would most 
likely be the biggest single cost factor and b. It reduces 
the largest source of bias. 

P-1 has 1t*s restrictions. First the dotflle must contain a suf- 
ficent number of pixels In each cover type of interest and should cover 
the range of variability within a cover type. Forest inventory data 
which provides only information on forested plots may be used, but ad- 
ditional dots must be identified and located In all of the nonforest 
cover types (urban, grassland, barren, and water). The accurate loca- 
tion and Identification of all dots Is critical, for mislabelled or 
poorly located dots may lead to spectral classes incorrectly Identified. 
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3. LACIE parameters should not be used to classify forested 
areas. The essentially noniterative ISOCLS clustering pro- 
cessor and K-Nearest Neighbor (K>1) method of labelling the 
ISOCLS statistics do not produce consistent results In a 
heterogeneous area. 

4. Seeding ISOCLS with Type 1 dots reduces CPU time, and shows 
promise for classifying forested areas. Preliminary research 
Indicates that each cover type should have at least two 

seed dots to better Insure that the smaller cover types are 
spectrally represented. 

5. When K-Nearest Neighbor Is used to label the spectral classes, 
K should equal that number of dots In the covertype with the 
smallest dot representation In the dotflle. 

The Multicluster Blocks approach to developing training statistics 
produced classification results on par (statistically Indistinguishable) 
with the best P-1 resuUs. The McB approach requires a great deal of 
analyst Interaction which, under certain circumstances. Is desireable. 
Experience has shown that P-l's automated labelling processor often 
misses critical cover types that make up only a small percentage of the 
study area. Also, naturally, LABEL cannot abide by arbitrary defini- 
tions unless those definitions are exemplified In the dotflle. Crown 
density differences (for Instance, between barren and conifer, where 
crown densities less than are considered nonforested) may lead to 
mislabelling. These problems are overcome using the McB approach since 
spectral class labelling is done by the analyst. The McB approach is 
also efficent when developing training statistics for large study areas. 
P-1 requires the entire study area be clustered, and this can become 
expensive when the area involved reaches into the hundreds of thousands 
of acres, unless the area is clustered on an interval of 2, 3, or 4 
(i.e., sample one fourth, one ninth, or one sixteenth of the pixels). 

The clustering of representative training blocks (Multicluster Blocks 
approach) may be more realistic. 


The method used to develop the training statistics most certainly 


influenced the classification more than the classifiers used. This 


study found that; 


1. 


The differences in classification accuracies 
EOOLARSYS Sum-of-Normal-Densities classifier 
maximum-likelihood classifier were minimal, 
evaluation showed no significant differences 
classifiers. 


between the 
and the LARSYS 
Statistical 
between the two 
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2. The CPU times used by the classifiers to classify 1687 
pixels were not significantly different. On the average, 
the maximum-likelihood classifier took more CPU time than 
the SoNO classifier. 

The clustering processor used to develop the training statistics 
significantly affected the classification accuracies. Sweeping con- 
clusions cannot be made due to the versatility of the ISOCLS processor. 
The six runs indicated: 

1. That P-1 approaches should use the ISOCLS processor. 

2. A Multicluster Blocks approach yields better results when 
the CLUSTER processor is used. 

3. CLUSTER takes approximately twice the CPU time to output 
a given number of spectral classes. 

These conclusions should be viewed in the light that future ISOCLS seed- 
ing and iteration studies may further refine the ISOCLS processor cap- 
abilities. CLUSTER grouped the training block data into easily identi- 
fiable spectral classes, the ISOCLS classes were very difficult to 
identify. Seeding and/or parameter changes would affect, and may im- 
prove, the quality of the clusters output by ISOCLS, making it useful 
to the NcB approach. 

The performance of the ISOCLS processor (number of clusters out- 
put, variability within clusters, and CPU time used) is affected by: 

1. the parameter levels used; 

2. the spectral variability of the study area; 

3. the size of the study area. 

Following are lists of parameters that might be used on a forested area 
by an analyst new to the rigors of Procedure 1 and the ISOCLS processor. 
The parameters provide a handhold, a starting point for the novice P-1 
analyst. The parameters given presume that approximately 8500 pixels 
(4 channel) are processed and the study area is spectrally heterogen- 
eous. Between 20 and 25 spectral classes are desired. 

Unseeded: No Type 1 dots input as Initial cluster means. 


STDMAX 

1.8 

SEQUEN 

SC 

PERC 

90 

DLMIN 

3.0 

I STOP 

10 

CLUS 

25 


all other parameters default. 
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If these parameters yield less than the desired number of spectral 
classes, reduce STDMAX and DLMIN (In 0.2 Increments) and/or Increase 
PERC to 100. 

If seeds are Input, the following parameters might be used. 

Seeded: Use 20-25 Type 1 dots If 20-25 spectral classes are desired. 
The percentage of dots In each cover type should be roughly proportional 
to the percent composition of the study area (I.e., If 30% of the study 
area Is hardwood forest, then 30% of the seeds should be hardwood dots). 
Each cover type should have at least two dots Input to the ISOCLS pro- 


cessor. 

STDMAX 

2.8 

SEQUEN 

SC 


PERC 

90 

DLMIN 

3.0 


ISTOP 

3 

CLUS 

25 


all other parameters default. 

If these parameters yield less than the desired number spectral 
classes, follow the Instructions above and/or Increase ISTOP In Incre- 
ments of 2. 

Again, the parameters given above are starting points and may not 
produce the results expected. ISOCLS parameters must be manipulated In 
order to produce an adequate set of training statistics (a sure thing, 
right up there with death and taxes). 

As Schubert (1978) stated, the ultimate foreseeable goal In Landsat 
data processing is the development of a fully automatic procedure which 
would allow a reproducible classification of a given scene. Procedure 1 
Is a viable method of classifying forested areas. It Is Ideally suited 
to utilizing forest Inventory data directly, as such it Con conceivably 
be modified Into an automated classification system. Many questions re- 
main, some of which are listed in the next section, but an automated 
forest inventory system is feasible and may attract interest from the 
private sector. 

Research Recommendations 

Researchers rarely find themselves at a dead end. The answer to 
one question gives rise to others. This research has raised points that 
might be of interest to others, especially those concerned with classi- 
fication procedures development. 
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The LABEL processor Is very Important In the development of P-1 
training statlsitics and may be the weakest link In the classification 
process. The Nearest-Neighbor concept Is Just one method of labelling 
the spectral classes. Pore et a1. (1978) reported that All-of-a-KInd 
labelling (explained, pg33) Improved classification accuracy In agri- 
cultural areas. This labelling procedure was used on the Vail eel to 
study area. Every cluster evidently had more than one kind of Type 1 
dot Included, for each defaulted to K-Nearest Neighbor. It Is recom- 
mended that future research address the following questions. 

1. Due to the heterogeneity of the for>ested tracts, would 
majority rule (Instead of unanlmot;. rule) better suit forest- 
land statistics labelling (1*e., check all the Type 1 dots 

In a cluster and label that cluster the Identity of the ma- 
jority of the dots' )? 

2. If K-Nearest Neighbor labelling Is used, does weighting for 
proximity to the mean (explained, pg58, footnote) have any 
merit? 

A number of questions remain to be answered about ISOCLS. The 
ISOCLS clustering processor may be seeded with Type 1 dots or with 
spectral values Input by the analyst. The effects of seeding will vary 
from study area to study area, however the general effects of the pro- 
portions (number of Type 1 dots Input in each cover type) and numbers of 
dots Input to ISOCLS remain unresolved. It Is recommended that the 
following questions be researched: 

1. What are the effects of number of split Iterations, number 
of seeds used, and propov'tlons of Type 1 dots used on CPU 
time requirements and classification accuracies? 

2. Does a proposed technique which might be called dotflle- 
cluster mean seeding hold any promise? The proposed tech- 
nique Is reviewed below. 

The dots used to seed ISOCLS should be representative of the area being 
clustered so that the 'natural' data grouping In the study area can be 
readily discerned. Inputting Type 1 dots randomly by cover type may 
not truly represent the spectral groupings present In the data. 


1. Ties could be handled In one of two ways: 

a. Default to K-Nearest Neighbor. 

b. Discard dots furthest from the cluster mean until the tie 
Is broken. 
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especially If Type 1 dot Identification errors exist. One possible 
method of obtaining more truly representative Initial cluster centers 
may be to 

a. compile the dotflle Information (I.e., locate and Identify 
the Type 1 dots on the study area); 

b. cluster the dotflle Into an adequate number of spectral 
classes (adequate meaning perhaps twice the number of major 
cover type groupings on the study area); 

c. label the spectral classes using the original dotflle and the 
LABEL processor; 

d. use the spectral class means of the dotflle to seed ISOCLS. 

If the dotflle Is large enough (so that most clusters contain 30 or more 
points) and truly characterizes the study area« ISOCLS might best be 
used as a grouping processor (LACIE parameters). The effects of Iter- 
ations on the quality of the training statistics output using this 
method should be documented. 

One of the disadvantages of the Procedure 1 approach Is that the 
entire area, or a systematic subsample of the aret, must be clustered in 
order to develop the unlabelled training statistics. A Monocluster 
Blocks approach might better utilize computer time while minimizing 
analyst Interaction. Essentially, instead of developing statistics over 
the entire area, the analyst would select a few training blocks (1600 
to 3600 pixels) in heterogeneous areas characteristic of the entire 
study area. All of the blocks would be clustered simultaneously, the 
statistics labelled (using LABEL) and the area classified. This ap- 
proach may combine the CPU time-saving capability of Monocluster Blocks 
with the automate 1 processing advantages of Procedure 1 while demanding 
only minimal, additional analyst Input. It Is therefore recommended 
that research be conducted to explore the ca 'abilities of a Monocluster 
Blocks-Procedure 1 approach. 

The Multicluster Block approach has been tested by two analysts 
using the same data set and results are comparable. The McB approach Is 
relatively simple procedure to learn, but does require some analyst 
experience In order to use It effectively. No future research Is neces- 
sary to further document this approach. It Is suggested that this ap- 
proach might be used In future research as a standard against which 
alternate procedures are compared. 
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Final Statement 

The accuracies of classification of the nonforest cover types ob- 
tained during this research might be uiiacceptable In an operational 
system. Studies have shown that multi date overlays can substantially 
Improve performance. Overlaying data sets may become cost effective. In 
fact, a standard procedure, whei NASA Implements software which geometri- 
cally corrects Landsat data sets prior to sale. 

Improved spatial and spectral resolutions and an Increased number of 
spectral bands (In the visible, reflective, and emissive Infrared) will 
undoubtedly Improve classification performances . This satellite plat- 
form Imagery may be used by automated classification systems which mesh 
the multi spectrcl Information with forest Inventory Information. The 
Procedure 1 approach Is just one method by which this might be accom- 
plished. 

The world's population Is currently growing at the rate of 80 
million people per year. It has been predicted that by the year 2000 
the world's population will have doubled to over 7 billion people. 
Availability and location of resources taks on a new Importance. Satel- 
lite platform Imagery provides the resource analyst synoptic Information 
In a quantitative format directly suited to computer analysis. The 
marriage of satellite and Inventory Information (be It forest or agri- 
cultural Information) In conjunction with automated-computer-alded- 
analysls techniques offer the possibility of monitoring resources on 
time schedules on the order of days. Such accurate and timely Informa- 
tion may someday be necessary as the world's agricultural areas and 
forestlands are pushed to their productive limits under environmental 
conditions less than Ideal. 

"Remote sensing from resource satellites provides mankind for 
the first time witn a potential capability for worldwide re- 
source mapping and environmental monitoring In near-real time. 

The Importance of such readily available and objective Infor- 
mation about the distribution, quality and exploitation of 
natural resources can hardly be overemphasized. Nan has become 
accustomed to the uncontrolled use of the envlronntent and has 
taken for granted the self-renewing and sel f-correcting ability 
of nature and the apparent abundance of natural resources. 

This misconception Is coming to an end and v/e are now beginning 
to experience the first adverse effects of a deteriorating 
environment ..." 


Kalensky and Nightman (1976). 
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